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Preface 


Our purpose in writing a multivariable calculus text has been to help students learn 
that mathematics is the language in which scientific ideas can be precisely for¬ 
mulated and that science is a source of mathematical ideas that profoundly shape 
the development of mathematics. 

In calculus, students are expected to acquire a number of problem-solving 
techniques and to practice using them. Our goal is to prepare students to solve 
problems in multivariable calculus and to encourage them to ask, Why does cal¬ 
culus work? As a result throughout the text we offer explanations of all the 
important theorems to help students understand their meaning. Our aim is to foster 
understanding. 

The text is intended for a first course in multivariable calculus. Only knowledge 
of single variable calculus is expected. In some explanations we refer to the fol¬ 
lowing theorems of calculus as discussed for example in Calculus With Applications , 
Peter D. Lax and Maria Shea Terrell, Springer 2014. 

• Monotone Convergence Theorem A bounded monotone sequence has a limit. 

• Greatest Lower Bound and Least Upper Bound Theorem A set of numbers 
that is bounded below has a greatest lower bound. A set of numbers that is 
bounded above has a least upper bound. 

Chapters 1 and 2 introduce the concept of vectors in W 1 and functions from W 1 to 
M m . Chapters 3 through 8 show how the concepts of derivative and integral, and the 
important theorems of single variable calculus are extended to partial derivatives 
and multiple integrals, and to Stokes’ and the Divergence Theorems. 

To do partial derivatives without showing how they are used is futile. Therefore 
in Chapter 8 we use vector calculus to derive and discuss several conservation laws. 
In Chapter 9 we present and discuss a number of physical theories using partial 
differential equations. We quote a final passage from the book: 

We observe, with some astonishment, that except for the symbols used, the 
equations for membranes in which the elastic forces are so balanced that they do not 
vibrate, and heat-conducting bodies in which the temperature is so balanced that it 
does not change, are identical. 


VI 


Preface 


There is no physical reason why the equilibrium of an elastic membrane and the 
equilibrium of heat distribution should be governed by the same equation, but they 
are, and so 

Their mathematical theory is the same. 

This is what makes mathematics a universal tool in dealing with problems of 
science. 

We thank friends and colleagues who have given us encouragement, helpful 
feedback, and comments on early drafts of the book, especially Louise Raphael of 
Howard University and Laurent Saloff-Coste and Robert Strichartz of Cornell 
University. We also thank Cornell students in Math 2220 who suggested ways to 
improve the text. We especially thank Prabudhya Bhattacharyya for his careful 
reading and comments on the text while he was an undergraduate Mathematics and 
Physics major at Cornell University. 

The book would not have been possible without the support and help of Bob 
Terrell. We owe Bob more than we can say. 

New York, USA Peter D. Lax 

Ithaca, USA Maria Shea Terrell 
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Chapter 1 

Vectors and matrices 


Abstract The mathematical description of aspects of the natural world requires a 
collection of numbers. For example, a position on the surface of the earth is 
described by two numbers, latitude and longitude. To specify a position above the 
earth requires a third number, the altitude. To describe the state of a gas we have 
to specify its density and temperature; if it is a mixture of gases like oxygen and 
nitrogen, we have to specify their proportion. Such situations are abstracted in the 
concept of a vector. 


1.1 Two-dimensional vectors 


Definition 1.1. An ordered pair of numbers is called a two-dimensional vec¬ 
tor. We denote a vector by a capital letter 


U = (mi, U 2 ). 


The numbers u\ and U 2 are called the components of the vector U. The set of 
all two-dimensional vectors, denoted R 2 , is called two-dimensional space. 

We introduce the following algebraic operations for two-dimensional vectors 

(a) The multiple of a vector U = {u \, uj) by a number c, cU, is defined as the vector 
obtained by multiplying each component of U by c: 


cU = (CU\,CU2). 


( 1 . 1 ) 


(b) The sum of vectors U = (u \, uj) and V = (vi, V 2 ), U + V, is defined by adding the 
corresponding components of U and V: 


U + V = (u\ + V \, U2 + V2). 


( 1 . 2 ) 
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1 Vectors and matrices 


We denote (0,0) as 0 and call it the zero vector. Note that U + 0 = U for every 
vector U. The symbol -U denotes the vector (-u\,-U 2 ). The vector V-U defined 
as V + (-U) is called the difference of V and U. 

Multiplication by a number (or scalar) and addition of vectors have the usual 
algebraic properties: 


u+v = v+u commutative 
(U + V) + W = U + (V + W) associative 
c(U + V) = cU + cV distributive 
(a + b) U = a\J + b\J distributive 
U + (—U) = 0 additive inverse 


In Problem 1.6 we ask you to verify these properties. Vectors U = (v,y) can be 
pictured as points in the Cartesian x,y plane. See Figure 1.1 for an example of two 
vectors (3,5) and (7,2) and their sum. 



By visualizing vectors as points in the plane, multiplication of a vector U by a 
number c and the addition of two vectors U and V have the following geometric 
interpretation. 

(a) For a nonzero vector U and a number c, the point cU lies on the line through the 
origin and the point U. Its distance from the origin is \c\ times the distance of U 
from the origin. The origin divides this line into two rays; when c is positive, U 
and cU lie on the same ray; when c is negative, U and cU lie on opposite rays. 
See Figure 1.2. 

(b) If the points 0, U, and V do not lie on a line, the four points 0, U, V, and U + V 
form the vertices of parallelogram. (We ask you to prove this in Problem 1.7.) 
See Figure 1.3. 

(c) For c between 0 and 1 the points V + cU lie on the line segment from V to V + U. 
That side of the parallelogram is parallel to the segment from 0 to U and has 
the same length. The directed line segment from V to V + U is another way to 
visualize the vector U. See Figure 1.4. 

We can visualize vector addition and multiplication by a number in two dimen¬ 
sions. But we will see in Section 1.4 that in dimensions higher than three it is the 
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Fig. 1.2 Points 0, U, and cU are on a line, c > 0. 



Fig. 1.3 0, U, U + V, and V form a parallelogram. 



Fig. 1.4 For 0 < c < 1 the points V + cU lie on a line segment from V to V + U. 


algebraic properties of vectors that are most useful. Two basic concepts we will use 
are linear combination and linear independence. 


Definition 1.2. A linear combination of two vectors U and V is a vector of 
the form 

a\J + b\ (1.3) 

where a and b are numbers. 


Example 1.1. The vector 


U =(5,3) 


is a linear combination of (1,1) and (-1,1) since 


U = 4(1,1) - (-1,1). 


□ 
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1 Vectors and matrices 


Example 1.2. Every vector (v,y) is a linear combination of (1,0) and (0,1) 
since 


(*,y) = *(l,0)+y(0,l). 


□ 

You might wonder if every vector in R 2 can be obtained as some combination of 
two given vectors U and V. As we will see in Theorem 1.1, the answer depends on 
whether or not U and V are linearly independent. 


Definition 1.3. Two vectors U and V are called linearly independent if the 
only linear combination aU + b\ of them that is the zero vector is the trivial 
linear combination with a - 0 and b = 0. 


Example 1.3. Are the vectors (1,0) and (0,1) are linearly independent? Sup¬ 
pose 

a(l,0) + fe(0,l) = (0,0). 

Then (a, 0) + (0 ,b) = ( a,b ) = (0,0). This implies 

a = 0, b = 0. 

Therefore the only linear combination of (1,0) and (0,1) that is (0,0) is the 
trivial one 0(1,0) + 0(0,1). The vectors (1,0) and (0,1) are linearly indepen¬ 
dent. □ 

Two vectors are called linearly dependent if they are not independent. 

Example 1.4. Are U = (1,2) and V = (2,4) linearly independent? Suppose 
fl (l,2) + i(2,4) = (0,0). 

Then ( a + 2b,2a + 4b) = (0,0). This is true whenever a = -2b. For example 
take a- 2 and b = -1. Then 

2U + (-l)V = 2(1,2) + (-1)(2,4) = (0,0). 

The vectors (1,2) and (2,4) are linearly dependent. □ 

The next theorem tells us that if C and D are linearly independent then we can 
express every vector U in R 2 as a linear combination of C and D. 

Theorem 1.1. Given two linearly independent vectors C and D in R 2 , every 
vector U in R 2 can be expressed uniquely as a linear combination of them: 

U = aC + bD. 


Proof. Neither of the vectors is the zero vector, for if one of them, say C were, then 
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1C + 0D = 0 

would be a nontrivial linear combination of C and D that is the zero vector. Next we 
show that at least one of the vectors C or D has nonzero first component. For if not, 
C and D would be of the form 


C = (0,C2), D = (0,d2)> r*2 i 1 0, d,2±0. 

But then J 2 C - cfO = (0, ^ 2 ^ 2 ) - (0,C2^2) = (0,0) = 0, a nontrivial linear relation 
between C and D. 

Suppose the first component c\ of C is nonzero. Then we can subtract a multiple 
a - — of C from D and obtain a vector D' whose first component is zero: 


D' = D-aC, 

say D' = (0, d). Since D' is a nontrivial linear combination of C and D, D' is not the 
zero vector, and d± 0. We then subtract a multiple of D' from C to obtain a vector 
C' whose second component is zero and whose first component is unchanged: 

C' = C-M)' = (c!,0). 

Since c\ and d are not zero, every vector U can be expressed as a linear combination 
of C' and D'. Since C' and D' are linear combinations of C and D, so is every linear 
combination of them. 

To check uniqueness suppose there were two linear combinations of C and D for 
a vector U, 

U = aC + bD = a'C + b'D. 


Subtract to get 

(a-a')C + (b-b')D = 0. 

Since C and D are linearly independent, this linear combination must be the trivial 
one with 

a -a' = b-b' =0. 

That proves a! - a and b' = b. This completes the proof of Theorem 1.1. □ 

A basic tool for studying vectors and functions of vectors is the notion of a linear 
function. 


Definition 1.4. A function £ from R 2 to the set of real numbers R whose input 
U is a vector and whose value £(U) is a number is called linear if 

(a) €(cV) = c€( U) and 

(b) a;u+v) = a;u)+a;v) 

for all numbers c and vectors U and V. 
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Combining these two properties of a linear function £ we deduce that for all 
numbers a , b and all vectors U, V 

£(a\J + b\) = £(dV) + £(b\) = a£(V) + b£(V). (1.4) 


Theorem 1.2. A function l from R 2 to R fv linear if and only if it is of the form 

£(x,y) = px + qy. (1.5) 

for some numbers p and q. 


Proof Suppose £ is linear. Take Ei and E 2 to be the vectors (1,0) and (0,1). We can 
express the vector (x,y) as xE\ + yE 2 . By linearity 

£(x,y) = £(xE\ +yE 2 ) = x£(E\) + y£(E 2 ). 

Let p - £(Ei) and q = £(E 2 ), then £(x,y) = px + qy for all (x,y) in R 2 . 

Conversely, we ask you in Problem 1.12 to show that every function of the form 
£(x,y) = px + qy is linear. □ 


Problems 

1.1. Use a ruler to estimate the value c shown in Figure 1.2. 

1.2. Make a sketch of two linearly dependent nonzero vectors U and V in R 2 . 

1.3. Let U = (1,-1) and V = (1,1). 

(a) Find all numbers a and b that satisfy the equation 

a\J + b\ = 0. 

Prove that U and V are linearly independent. 

(b) Express (2,4) as a linear combination of U and V. 

(c) Express an arbitrary vector (v,y) as a linear combination of U and V. 

1.4. Find a number k so that the vectors (k, -1) and (1,3) are linearly dependent. 

1.5. Find a linear function £ from R 2 to R that satisfies £(1,2) = 3 and £(2,3) = 5. 

1.6. Let U = ( U\,U 2 ), V = (vi,V 2 ), and W = (w\,W 2 ) be vectors in R 2 and let a , b , 
and c be numbers. Use the definitions U + V = (u\ + v\ , U 2 + V 2 ), cU = (cu\ , cuf), and 
-U = (-mi, -M 2 ) to show the following properties. 


(a) U + V = V + U 
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(b) U + (V + W) = (U + V) + W 

(c) c(U + V) = cU + cV 

(d) (a + b)X} = a\J + bU 

(e) U + (-U) = 0 

1.7. Suppose that the points 0 = (0,0), U = ( u\,U 2 ), and V = (vi, V 2 ) do not all lie on 
a line. Show that the quadrilateral with vertices 0, U, U + V, and V is a parallelogram 
by proving the following properties. 

(a) the line through 0 and U is parallel to the line through V and U + V, 

(b) the line through 0 and V is parallel to the line through U and U + V. 

1.8. (a) Make a sketch of two nonzero vectors U and V in R 2 such that U is not a 
multiple of V. 

(b) Using U and V from part (a) make a sketch of the vectors U + V, -V, and U - V. 



Fig. 1.5 Vectors in Problem 1.9. 


1.9. Three vectors U, V, and W are drawn as directed segments between points in the 
plane in Figure 1.5. Express W in terms of U and V, and show that U + V + W = 0. 

1.10. Several vectors are drawn in Figure 1.6 as directed segments between points 
in the plane. 

(a) Express Y as a linear combination of U and V and verify that U + V + Y = 0. 

(b) Express Y as a linear combination of W and X and verify that W + X - Y = 0. 

(c) Show that U + V + W + X = 0. 



Fig. 1.6 A polygon of vectors in Problem 1.10. 


1.11. Let U = (ui,U 2 ). Show that the function £(U) = u\- 8^2 is linear. 



1 Vectors and matrices 


1.12. Let £ be a function from R 2 to R of the form £(x,y) - px + qy where p and q 
are numbers. Show that £ is linear by showing that for all vectors U and V in R 2 and 
all numbers c , the following properties hold. 

(a) £(cV) = c£( U) 

(b) £(U + V) = £(XJ) + £(Y) 

1.13. Write the vector equation 

(4,5) = a(l,3) + fe(3,l) 

as a system of two equations for the two unknowns a and b. 

1.14. Consider the system of two equations for the two unknowns x and y, 

3x + y=0 
5x+ 12y = 2. 

The word “system” means that we are interested in numbers x,y that satisfy both 
equations. 

(a) Write this system as a vector equation xU + yV = W. 

(b) Solve for x and y. 

1.15. Let U = (1,2) and V = (2,4). Find two ways to express the vector (4,8) as a 
linear combination 

(4,8) = aU + 6V. 

Are U and V linearly independent? 

1.16. Consider the vectors U = (1,3) and V = (3,1). 

(a) Are U and V linearly independent? 

(b) Express the vector (4,4) as a linear combination of U and V. 

(c) Express the vector (4,5) as a linear combination of U and V. 



Fig. 1.7 Three points as in Problem 1.17. 
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1.17. Let U, V, and W be three points on the unit circle centered at the origin of R 2 , 
that divide the circumference into three arcs of equal length. See Figure 1.7. 

(a) Show that rotation by 120 degrees around the origin carries U + V + W into itself. 
Conclude that the sum of the vectors U, V, and W is 0. 

(b) Conclude that 

sin(#) + sin(#+ ^) + sin(#+ y) = 0 

for all 6. 

n 

(c) Show that ^ cos (6 + = 0 for all 6 and all n- 2,3,.... 

k= l 

1.18. Let /(U) be the distance between the points U and 0 in R 2 . 

(a) For what numbers c is f(cU ) = c/(U) true? 

(b) Is / a linear function? 

1.19. Suppose / is a linear function and /(-.5,0) = 100. Find /(.5,0). 

1.20. Suppose / is a linear function and /(0,1) = -2, /(1,0) = 6. 

(a) Find /(1,1). 

(b) Find f(x,y). 


1.2 The norm and dot product of vectors 


Definition 1.5. The norm of U = (x,y), denoted as ||U||, is defined as 

l|U|| = JF+f. 


A unit vector is a vector of norm 1. 

Applying the Pythagorean theorem to the right triangle whose vertices are (0,0), 
(v,0), and (v,y), (see Fig. 1.8), we see that the norm of (v,y) is the distance between 
(v,y) and the origin. The norm of U is also sometimes called the length of U. 

Example 1.5. The norm of U = (1,2) is 

||U|| = Vl 2 + 2 2 = Vs. 


The norm of V = -^) is 


so V is a unit vector. 


IIVII = V (^) 2 + (#) 2 = 1 , 


□ 
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A concept related to norm is the dot product of two vectors U and V. 

Definition 1.6. The dot product of U = ( u\,U 2 ) and V = (vi,V 2 ) is 

U • V = U\V\ + U 2 V 2 - (1.6) 


The dot product has some of the usual properties of a product. 

(a) Distributive: for vectors U, V, and W 

U • (V + W) = U • V + u • w, 

(b) Commutative: for vectors U and V 


u v = v u. 


In Problem 1.21 we ask you to verify the distributive and commutative properties. 

It follows from Definitions 1.5 and 1.6 of the norm and dot product that the dot 
product of a vector with itself is its norm squared: 

U-U = ||U|| 2 . (1.7) 

We have shown in Theorem 1.2 that every linear function from R 2 to R is of the 
form £(U) = £(x,y) = px + qy. This result can be restated in terms of the dot product: 

Theorem 1.3. A function l from R 2 to R is linear if and only if it is of the form 

£(U) = C-U, 

where C = ( p , q) is some vector in R 2 . 


Example 1.6. Let £ be a linear function for which 


£(1,1) = 5 and £(-1,1) =-1. 
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Let’s find the vector C = ( p,q ) so that l(\J) = C • U. By Theorem 1.3 we have 
5 = €(l,l) = (p,q)-(l,l) = p + q, -1 = =-p + q. 

Solving for p and q we get 2q = 4, g = 2, and p = 3, so 
€(x,y) = (3,2)- (v,y) = 3v + 2y. 


□ 

An interesting relation between norm and dot product follows from the distribu¬ 
tive and commutative laws applied to the dot product. Using the distributive law we 
see 


(U-V)-(U-V) = U-(U-V)-V-(U-V) = U-U-U-V-V-U + V-V. 

Using the notation of norm and commutativity of the dot product, U • V = V • U, we 
can rewrite the equation above as 

||U - V|| 2 = ||U|| 2 + ||V|| 2 - 2U • V. (1.8) 

Since ||XJ — V|| 2 is nonnegative, it follows from (1.8) that 

U-V<i||U|| 2 + i||V|| 2 . (1.9) 

We show next that an even sharper inequality holds: 

Theorem 1.4. For all vectors U and V in R 2 the following inequality holds. 

U-V < ||U||||V||. (1.10) 


Proof If either U or V is the zero vector, inequality (1.10) holds, because both sides 
are zero. If both U and V are unit vectors, inequality (1.10) follows from (1.9). For 
all nonzero vectors U and V, 

—U, —V 

nun iivii 

are unit vectors and therefore by inequality (1.9) 

UV 

HUH IIVII ’ 

from which inequality (1.10) follows. □ 

For all vectors U and V, 

o < (IIUII - IIVII) 2 = IIUII 2 + IIVII 2 -2IIUII IIVII. 


Therefore 
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1 Vectors and matrices 


By (1.10) we see that 


U-V<||U||||V||<i||U|| 2 + i||V|| 2 
Thus (1.10) is a “sharper” inequality than (1.9). 


, V 

y N 

\ 


y 


x’ 



X 


Fig. 1.9 Coordinate axes x, y, and x',y'. 


Suppose we replace the coordinate axes by another pair of perpendicular lines 
through the origin. See Figure 1.9. Let x' and / be the coordinates in the new system 
of a vector U whose coordinates in the original system were x and y. Then 


/2 . /2 2,2 
x +y = x +y 


because both sides express the square of the distance between the point U and the 
origin. 

The dot product of two vectors calculated in the new coordinates is equal to their 
dot product calculated in the old coordinates. To see that we note that formula (1.8) 
holds in both coordinate systems. The term on the left and the first two terms on the 
right are norms and therefore have the same value in both coordinate systems; but 
then the remaining term on the right, two times the dot product of U and V, must 
also have the same value in both coordinate systems. 




Fig. 1.10 The angle 6 between vectors U and V. 
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The independence of the dot product of the coordinate system suggests that the 
dot product of two vectors U and V, introduced algebraically, has a geometric sig¬ 
nificance. To figure out what it is we use a coordinate system where the new positive 
x' axis is the ray through the origin and the point U; see Figure 1.10. In this new 
coordinate system the coordinates of U and V are: 

U = (||U||,0), V = (||V||cos0,||V||sin0), 

where 6 is the angle between U and V. That is, the angle 6 between the positive 
x' axis and the line through 0 and V, where 0 < 6 < n. So in this coordinate system 
the dot product of U and V is 


U-V = ||U|| ||V||cos0. 

Since the dot product in our two coordinate systems is the same, this proves the 
following theorem. 

Theorem 1.5. The dot product of two nonzero vectors U and V is the product 
of the norms of the two vectors times the cosine of the angle between the two 
vectors, 

U-V = ||U||||V||cos0. 


In particular if two nonzero vectors U and V are perpendicular, 6= f, their dot 
product is zero, and conversely. When the dot product of vectors U and V is zero we 
say that U and V are orthogonal. 


Problems 


1.21. LetU = {u\,U 2 ), V = (vi,V 2 ), and W = (wi,W 2 ). Prove 

(a) the distributive property U • (V + W) = U • V + U • W. 

(b) the commutative property U • V = V • U. 

1.22. Which vectors are orthogonal? 

(a) ( a,b ), (~b,a) 

(b) (1,-1), (1,1) 

(c) (0,0), (1,1) 

(d) (U), (U) 

1.23. Which of these vectors are unit vectors? 

(a) (|,f) 

(b) (cos 6, sin 6) 

(c) (V8 ,y[2) 

(d) (.8,.2) 
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1.24. Use equation (1.8) and Theorem 1.5 to prove the Law of Cosines: for every 
triangle in the plane with sides a , b, c and angle 6 opposite side c, (see Figure 1.11), 

c 2 = a 2 + b 2 - lab cos 6. 



Fig. 1.11 A triangle in Problem 1.24. 


1.25. Let £ be a linear function from R 2 to R for which €(2, 1) = 3 and €(1, 1) = 2. 
Find the vector C so that £(U) = C • U. 

1.26. Find the cosine of the angle between the vectors U = (1,2) and V = (3,1). 

1.27. Use equation (1.8) to show that for every U and V in R 2 , 

||U + V || 2 = ||U || 2 + ||V || 2 + 2U • V. 

1.28. Let U = (x, y). Find a vector C to express the equation of a line y = mx + b as 
C-U = b. 

1.29. If C and D are orthogonal nonzero vectors, there is a simple expression for a 
and b in a linear combination 

U = aC + M). 

C-U 

(a) Dot both sides of this equation with C to show that a =--. 

IICII 2 

(b) Find a formula for b. 

(c) If (8,9) = a{\ 4) + K - 5 . 5 ). find a. 


tu 



Fig. 1.12 Vectors V and U for Problem 1.30. 





1.3 Bilinear functions 
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1.30. Let U be a nonzero vector and t a number. Let fit) be the distance between a 
point V and the point t\J on the line through 0 and U as shown in Figure 1.12. 

(a) Use calculus to find the value of t that minimizes if it)) 2 . 

(b) Use a dot product to find the value of t that makes the angle a in the figure a right 
angle. 

(c) Confirm that the numbers t that you found in parts (a) and (b) are the same. 

1.31. Express the vectors U = (1,0), V = (2,2) in the coordinate system rotated | 
counterclockwise. 

1.32. A regular octagon is shown in Figure 1.13. It shows vertex P = (c, s) where c 
and s are the cosine and sine of |. 

(a) Show that vertex Q is is,c). 

(b) Show that sin(|) = \ ^2- V2. 



Fig. 1.13 The octagon in Problem 1.32. 


1.3 Bilinear functions 


Definition 1.7. A function b whose input is an ordered pair of vectors U and 
V and whose output is a number is called bilinear if, when we hold U fixed, 
b(U, V) a linear function of V, and when we hold V fixed, Z?(U, V) is a linear 
function of U. 


As we shall see, many important functions are bilinear. 

Example 1.7. Let U = (^ 1 ,^ 2 ), V = (vi, V 2 ), W = (wi,W 2 ), and define the func¬ 
tion b by 


fc(U,V) = MlVi. 
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To see that b is bilinear, we first hold U fixed and check that b( U, V) is linear 
in V. That is, check that for all numbers c and vectors V and W, 

KU,V + W) = fe(U,V) + fe(U, W), Z?(U, cV) = cb( U,V). 

For the first we have 

fe(U,V +W) = uiiyi + wi) = u\V\ + u\W\ = fe(U,V) + &(U,W). 

For the second, 


fe(U,cV) = mi(cvi) = cmivi = cfc(U, V). 

So £ is linear in V. We ask you in Problem 1.33 to show by a similar argument 
that b is linear in U when we hold V fixed. □ 

We saw in Example 1.7 that u\V\ is a bilinear function of (U, V). Similarly u\v 2 , 
u 2 v\, and U2V2 are bilinear. The next theorem describes all bilinear functions. 

Theorem 1.6. Every bilinear function b of U = (mi,W 2 ) and V = (vi,V 2 ) is of 
the form 

Z?(U, V) = eu\v\ + fu\V 2 +gU 2 V\ -\-hu 2 v 2 , (1.11) 

where e, f, g, and h are numbers. 


Proof For fixed V, b(\J, V) is a linear function of U. According to Theorem 1.2, 
b(\J, V) has the form (1.5): 


b(U,\) = pu 1 +qu 2 , (1.12) 

where the numbers p and q depend on V. To determine the nature of this dependence 
first let U = Ei = (1,0) and then U = E 2 = (0,1). We get 

*((1,0), V) = (/?)( 1 ) +(g)( 0 ) = p = p(V), b{( 0 , 1 ), V) = (p)(0) + (*)(l) = g = g(V). 

Since b is bilinear, the functions p and g are linear functions of V. Therefore they 
are of the form 

/*(V) = ev\ + fv 2 , g(V) = gvi + Av 2 , 

where e , /, g, and /z are numbers that do not depend on V. Setting these formulas 
for p and q into formula ( 1 . 12 ) gives 

fc(U, V) = (evi + fv 2 )u 1 + (gv 1 +hv 2 )u 2 , 


which gives 


*(U,V) = ^MlVi + fu\V2+gU2V\ +hu 2 V2. 


□ 



1.4 ^-dimensional vectors 
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We ask you in Problem 1.36 to prove the following theorem. 


Theorem 1.7. A linear combination of bilinear functions is bilinear. 


Example 1.8. The dot product U • V has the properties 


(cU)-V = cU-V, (U + V)-W = U-W + V-W, U-(V + W) = U-V + U-W. 
This shows that U • V is a bilinear function of U and V. Its formula 

U V = U\V\ + U 2 V 2 

is a special case of formula (1.11) in Theorem 1.6. □ 

Example 1.9. Let 

b( U, V) = U 1 V 2 — U 2 V 1 , 

where U = (u\,U 2 ), V = (vi,V 2 ). The terms u\V 2 and U 2 V\ are bilinear. By 
Theorem 1.7 b is a bilinear function. □ 


Problems 


1.33. Let U = (u\,U 2 ), V = (vi,V 2 ). Show that the function b(ll, V) = u\V\ is linear 
in U when we hold V fixed. 

1.34. Let U = (u\,U 2 ), V = (vi,V 2 ). Is the function b(ll, V) = u\U 2 bilinear? 

1.35. Define f(p,q,r,s) = qr+ 3 rp — sp. Sort two of the variables p,q,r,s into a 
vector U and the other two into a vector V to express / as a bilinear function b( U, V). 

1.36. Prove Theorem 1.7. That is, suppose Z?i(U, V) and & 2 (U, V) are bilinear func¬ 
tions and c\,C 2 are numbers; show that the function b defined by 

b( U, V) = ci bi (U, V) + c 2 b 2 ( U, V) 


is a bilinear function. 


1.4 ^-dimensional vectors 

We extend the concepts of vectors and their algebra from two dimensions to n 
dimensions, where n is any positive integer. 
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Definition 1.8. An ordered /i-tuple 


U = 0 Ul,U 2 ,...,U n ) 


of numbers is called an n-dimensional vector. The numbers uj are called the 
components of the vector U, and Uj is called the j-th component of U. The set 
of all ^-dimensional vectors, denoted R n , is called ^-dimensional space. 

The vector all of whose components are zero is called the zero vector and is 
denoted as 0 : 


0 = ( 0 , 0 ,..., 0 ). 


There is an algebra of vectors in R n entirely analogous to the algebra in R 2 described 
in Section 1.1. 

(a) Let U = (u\,U2,...,u n ) and let c be a number. The multiple cU is defined by 
multiplying each component of U by c: 


cU = ( CU\ , CU2 ,..., cu n ). 


(b) The sum of U = (u\ , U2, ..., u n ) and V = (vi, V2,... , v n ) is defined by adding the 
corresponding components of U and V: 


U + V = (u\ + Vi, U2 + V2, • • •, u n + V n ). 


In Problem 1.37 we ask you to verify that R^ has the usual algebraic properties 
c(X + Y) = cX + cY, X + Y = Y + X, X + (Y + Z) = (X +Y) + Z. 
According to the third of these we write X + (Y + Z) as X + Y + Z. 

Definition 1.9. Let k be a positive integer. A linear combination of vectors 
Ui, U 2 ,..., 14 in R n is a vector of the form 


k 



where ci,C 2 ,...,c^ are numbers. The set of all such linear combinations is 
called the span of Ui,U 2 ,...,Ufc. 

A linear combination is called trivial if all the numbers Cj are zero. 
Example 1.10. Let 


U = (3,7,6,9,4), 
V = (2,7,0,1,-5). 




1.4 ^-dimensional vectors 
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The vector 

2U + 3V = (12,35,12,21,-7) 

is a linear combination of U and V. In Problem 1.43 we ask you to show that 
the vector 

(~ l -1 3 1 j) 

is also a linear combination of U and V. □ 


Definition 1.10. The vectors Ui,... ,14 in R n are called linearly independent 
if the only linear combination of them that is the zero vector is the trivial one. 
That is, 

if ciUi + C 2 U 2 H- 1 - Ck\Jk = 0 then c 1 = 0 , C 2 = 0 , ..., c^- 0 . 

If Ui,... ,Ujfc are not linearly independent, they are called linearly dependent. 


Example 1.11. The vectors 

Ei = (1,0,0,0), E 2 = (0,1,0,0), E 3 = (0,0,1,0), E 4 = (0,0,0,1) 
in R 4 are linearly independent because the linear combination 
C 1 E 1 +C 2 E 2 + C 3 E 3 +C 4 E 4 = (C 1 ,C 2 ,C 3 ,C 4 ) 
is the zero vector only when c\ = c '2 = c 3 = c\ = 0 . □ 

Definition 1.11. Let Ui, U 2 ,..., 14 be linearly independent vectors in R n , with 
k < n. Let t\, t 2 ,..., tk be numbers and let U be a vector in R n . The set of all 
vectors of the form 

U + fiUi h- 1 - tjJJk 

is called a k-dimensional plane in R n through U. When k — n - 1 we call it a 
hyperplane. When U = 0 the k-dimensional plane through the origin is called 
the span of Ui,U 2 ,... ,U&. 


Theorem 1.8. n + 1 vectors Ui,U 2 ,...,U n +i in R n are linearly dependent. 


Proof. The proof is by induction on n. Take first n- 1, and let u and v be in R. If 
both u and v are zero then u + v = 0 is a nontrivial linear combination. Otherwise 
( v)u + (-m)v = 0 is a nontrivial linear combination. Therefore the theorem holds for 
n= 1 . 

Suppose inductively that the theorem holds for n - 1. Look at the n -th components 
of the vectors Ui,U 2 ,...,U n +i in R n . If all of them are zero, then there are n+ 1 
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vectors in R n ~ x obtained by omitting the final zeros; so by the induction hypothesis 
they are linearly dependent. 

Consider the case where one of the vectors, call it U w +i, has a nonzero n- th com¬ 
ponent. Subtract a suitable multiple Ci of U n+ i from each of the other vectors U/ so 
that the n-th component of each difference 

U- = U/ - Ci\J n+ i (f = 1,2./i) 

is zero. Omitting the final zero of U- we obtain n vectors V, in R” -1 . According to 
the induction hypothesis they are linearly dependent, that is they satisfy a nontrivial 
linear relation 

n 

2/,:V,=0. 

i-\ 

Adjoining a final zero component to each of the vectors V* gives the nontrivial linear 
relation 

n 

2>u; = o. (i.i3) 

i=l 

Setting U' = U* - c/U w +i into equation (1.13) we get the nontrivial relation 

n 

Y J ki(U i -c i U n+l )= 0. (1.14) 

1=1 

This proves the linear dependence of Ui,...,U w+ i. This completes the proof by 
induction. □ 

Example 1.12. The vectors in R 3 , 

Ei = (1,0,0), E 2 = (0,1,0), E 3 = (0,0,1), X = (2,4,3) 

are linearly dependent: Since 

X = 2Ei+4E 2 + 3E 3 , 

we have a nontrivial linear combination equal to zero: 

2Ei+4E 2 + 3E 3 -X = 0. 


□ 

Figure 1.14 illustrates other examples of linear dependence and independence in 

R 3 . 


Theorem 1.9. Suppose Ui,...,U W are linearly independent in R n . Then each 
vector X in R n can be expressed uniquely as a linear combination of the U;.* 

X = c\XJi H- \-c n U n . (1.15) 



1.4 ^-dimensional vectors 


21 



Proof. According to Theorem 1.8 there is a nontrivial relation among the n+ 1 vec¬ 
tors X,Ui,...,U„: 

$oX + + • • • + o n \J n — 0. 

In this relation, a$ is not zero. If it were zero then the linear independence of the 
Vi would imply that all the at are zero. Divide by ao to get relation (1.15) with 

Ci — —ail CLq . 

If there were two different representations of X of the form (1.15), their difference 
would be a nontrivial linear relation among the U* contrary to their linear indepen¬ 
dence. □ 

Linear functions. Just as for two-dimensional vectors, a function £ that assigns a 
number to each vector in R n is called linear if for all U and V in R n 

(a) £(cV) = c£( U) for all numbers c, and 

(b) £(U+V) = £(U)+£(V). 

Combining properties (a) and (b) we deduce that a function £ from R n to R is linear 
if for all U and V in R n and numbers a and b , £ satisfies 

£(aV + b\) = a£<JJ) + b£(S). (1.16) 

Every function £ of U = (u\ , « 2 , •••, u n ) of the form 

^(U) = C\U\ +C2U2 H - 1 -c n u n , (1.17) 

where ci,C 2 ,... ,c n are numbers, has properties (a) and (b), and therefore is a linear 
function. (See Problem 1.45.) Conversely, we have the following theorem. 


















22 


1 Vectors and matrices 


Theorem 1.10. Let l be a linear function from R n to R. Then there are num¬ 
bers c\,...,c n such that 

t(JJ) = Cl Ml + C2U2 H- 

/or vector XJ = (u\,U 2 ,...,u n ) in R n . 


Proof Let E ; - be the vector in M/ 2 whose j-th coordinate is 1 and all other coordi¬ 
nates are zero, 


Ei = (1,0,0,0,... ,0) 

E 2 = (0,1,0,0,...,0) 
e 3 = (0,0,1,0,... ,0) 

E n = (0,0,0,0,..., 1). 

The vector U = (u \, U 2 ,..., u n ) can be expressed as the linear combination of the E ; -: 


U — M 1 E 1 + W 2 E 2 + • • • + u n E n 
Set cj = €(JLj), j = 1 ,... ,m. Since £ is linear, 

£(U) = ^i^(Ei) + M 2 ATC 2 ) h- 1 - 


= Cl Mi +C2M2 H-1 ~C n U n . 


□ 

As in the two-dimensional case, we call a function b of two vectors in R n bilinear 
if for every vector V, b( U, V) is a linear function of U, and for every vector U it is 
a linear function of V. 

Every function of the form ujVk is bilinear, and as in Theorem 1.7, so are linear 
combinations of them. The following result, an extension of Theorem 1.6, charac¬ 
terizes all bilinear functions. 


Theorem 1.11. Let b be a bilinear function of U = (mi,...,m„) and V = 
(vi,..., v n ). Then b is a linear combination of the functions fjk defined by 

fjk(U,Y) = fj k (ui,...,u n ,vi,...,v„) = UjV k , j= 1 k= 


Proof We fix the vector V and consider Z?(U, V) as a linear function of U. By The¬ 
orem 1.10 it has the form 
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&(U,V) = C\U\ + C2U2 + * * * + C n U n , 


(1.18) 


where c \, c 2 ,..., c n are functions of V. Since b is a bilinear function, the c L are linear 
functions of V. According to Theorem 1.10 each C( is a linear combination of the 
Vk. Setting this into formula (1.18) we get an expression of b(U, V) as a linear com¬ 


bination of UjVk , as asserted. 


□ 


Problems 

1.37. Let V = (vi,..., v n ), U = (u \,..., u n ), and W = (w\ ,..., w n ) be vectors in R n , 
and let c and d be numbers. Show that 

(a) V + W = W + V 

(b) (V + U) + W = V + (U + W) 

(c) c(U + V) = cU + cV 

(d) (c + d)U = cU + JU 

1.38. Express the vector (1,3,5) as a linear combination of the vectors 

Ui= (1,0,0), U 2 = (1,1,0), U 3 = (1,1,1). 

1.39. Show that every vector in R 3 is a linear combination of the vectors 

Ui = (1,0,0), U 2 = (1,1,0), U 3 = (1,1,1). 

1.40. Determine whether the vectors 

Ui = (1,0,0), U 2 = (1,1,0), U 3 = (1,1,1). 

are linearly independent. 

1.41. Show that the vectors 

( 1 , 1 , 1 , 1 ), ( 0 , 1 , 1 , 1 ), ( 0 , 0 , 1 , 1 ), ( 0 , 0 , 0 , 1 ) 
are linearly independent in R 4 . 

1.42. Are the vectors 


(1,2,1), (1,2,2), (1,2,3), (1,2,4) 


linearly dependent or independent? What theorem of this section is particularly 
applicable to them? 
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1.43. Let 


U = (3,7, 6 ,9,4), 
V = (2,7,0,1,-5). 


Show that the vector 



is a linear combination of U and V. 

1.44. The set of points (x,y,z) in R 3 satisfying 

-1<jc< 1, -1 < y < 1, -1 <z<1, 

is a solid cube. Write out the coordinates of the eight corner points of the cube. Is 
there a linear function that is equal to 8 on every corner? 

1.45. A function € from R n to R is defined by 

ATJ) = c\u,\+ C 2 U 2 h- 1 - c n u n 

where U = (u \, m 2 ,..., u n ) and c\,...,c n are numbers. 

(a) Show that €(cU) = c€(U) for all vectors U and numbers c. 

(b) Show that for all vectors U and V in R", €(U + V) = £(U) + £(V). 

1.46. Let U = (1,3,1) and V = (2,2,2). Express W = (3,5,3) as a linear combination 
of U and V. 

1.47. Show that the vectors (1,1,1), (1,2,3), and (3,2,1) are linearly dependent. 

1.48. Let P be the set of all the points (v,y,0,0), and let Q be the set of all points 
(0,0, z, w) in R 4 . P and Q are two-dimensional planes in R 4 . How many points do P 
and Q have in common? 

1.49. Let X = (x\,X 2 ,...,x n ) and Y = (yi,y2»---»3 ; n) i n Which functions are 
bilinear? Of the bilinear functions, which are symmetric: b(X,Y) = b( Y,X)? Which 
ones are antisymmetric: b(X, Y) = -Z?(Y,X)? 

(a) b(X,Y) = Xl y n 

(b) b(X, Y) = x\y n - x n y\ 



(d) b(X, Y) = x\y\ + v 2 y 2 + ■ • ■ + x n y n 

1.50. Let U = (u\,U 2 ,U 2 ,U 4 ), Y = (vi,v 2 ,V 3 ,V 4 ), and W = (wi,w 2 ,W 3 ,W 4 ). Which of 
the following functions / have the antisymmetry property 


/(U,V,W) = -/(V,U,W)? 


(a) /(U, V, W) = u\v\w\ 

(b) /(U, V, W) = U 1 W 3 -V 1 W 2 
(C) /(U, Y, W) = (U 2 V 3 - u 3 v 2 )w 4 




1.5 Norm and dot product in n dimensions 
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1.51. Let U = (u\ , U 2 ,u^) and V = (vi,V 2 ,V 3 ). Which bilinear functions b have the 
symmetry property 

fe(U,V) = fc(V,U)? 

(a) b(\J, V) = IOmiVi 

(b) fr(U,V) = U\V 2 ~U 2 V\ 

(c) b(\J, V) = U\V 2 + M 2 V 2 + IOW 3 V 1 

(d) b(\J, V) = W 1 V 3 + 10^2^2 + W 3 V 1 

1.52. Let U = (mi,W 2 »« 3 ) and V = (vi,V 2 ,V 3 ). Which bilinear functions b have the 
antisymmetry property 

fc(U,V) = -fe(V,U)? 

(a) b(V,\)=l0uivi 

(b) fr(U,V) = U\V 2 ~U 2 V\ 

(c) b(\J, V) = U\V 2 + U 2 V 2 + IOW 3 V 1 

(d) b(\J, V) = U 1 V 3 + 10^2^2 + U 3 V 1 


1.5 Norm and dot product in n dimensions 


We define now, in analogy with two-dimensional vectors, the norm of a vector in 
R”, and the dot product of two vectors in R n . 

Definition 1.12. The norm of a vector U = («i, z< 2 » • • • * u n ) is defined as 

HUH = a / M 2 + m 2 + ... + m 2 _ (1. 19) 


The zero vector 0 has norm zero, and conversely only the zero vector has norm 
zero. As in R 2 we think of the norm of U as the length of U or the distance from the 
origin to the point U. 


Definition 1.13. We define the dot product , U • V, of U = (u\,U2, .. 
V = as 

.,u n ) and 

U • V = U\V\ + U2V2 H - 1 - u n v n . 

(1.20) 


We ask you in Problem 1.53 to verify that the dot product is distributive and 
commutative: 


U • (V + W) = U • V + u • w, 


u v = V u. 
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The dot product of a vector with itself is its norm squared: 

u-u = nun 2 . 

Theorem 1.10 can be restated as follows: 

Every linear function £ from R n to R can be expressed as 

£(U) = C-U, 


where C is some vector in R”. 

Inequalities. The relation in R 2 

||U-V|| 2 = ||U|| 2 + ||V|| 2 - 2U • V (1.21) 

was derived using only the distributive and commutative rules for the dot product. 
Therefore it holds for vectors in R n as well. We show you in Problem 1.59 a different 
proof. 

Next we prove a very useful inequality that compares U • V to ||U|| ||V||. 


Theorem 1.12. The Cauchy-Schwarz inequality. Let U and V be vectors in 
R n . Then 


|U-V|<||U||HV||. 


( 1 . 22 ) 


Example 1.13. We have 


( 1 , 1 , 0 )* ( 0 , 1 , 1 ) = 1 , ( 1 , — 1 , 0 ) • ( 0 , 1 , 1 ) = - 1 , 


and in both cases the absolute value of the dot product is less than the product 
of norms, V2 V2. □ 

Proof. If U is the zero vector then both sides of the inequality are zero, so it holds 
in that case. Since the square is nonnegative 

0<||V-(U-V)U|| 2 . (1.23) 

Using (1.21) rewrite the right side of (1.23) to get 

0 < ||V|| 2 + ||(U • V)U|| 2 - 2(U • V) 2 

= ||V|| 2 + (U • V) 2 ||U|| 2 - 2(U • V) 2 . 

If U is a unit vector, ||U|| = 1 and we get 

0<||V|| 2 -(U-V) 2 

so (U • V) 2 < ||V|| 2 . Taking square roots we get that for unit vectors U, 



1.5 Norm and dot product in n dimensions 
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|U'V|<||V||. 

Next if U is a nonzero vector, is a unit vector, so 

1 


IIU|| 


-U V 


Multiply by ||U|| to get (1.22). □ 

We ask you in Problem 1.64 to find out when equality holds in the Cauchy- 
Schwarz inequality. An important consequence of the Cauchy-Schwarz inequality 
is the triangle inequality for vectors in R n . 

Theorem 1.13. Triangle inequality If U and V are vectors in R n then 

HU + V||<||U|| + ||V||. 


Proof We have seen in equation (1.21) that 

l|U-V|| 2 = ||U|| 2 + ||V|| 2 -2U-Y. 

Replace V by -V and use the Cauchy-Schwarz inequality to get 
IIU + VII 2 = ||U|| 2 + ||V|| 2 + 2U-Y 

<I|U|| 2 + ||V|| 2 + 2||U||||V|| = (||U|| + ||V||) 2 . 

Take the square root to get ||U + V|| < ||U|| + ||V|| as asserted. □ 

Using the Cauchy-Schwarz inequality we can define the angle between two 
nonzero vectors U and V in R n as follows. From the Cauchy-Schwarz inequality 
we see that for U and V nonzero 


-1 < 


U V 

iiuimvii 


< i. 


We define the angle 6 between U and V to be 


6 = cos 


U V \ 

mwwwr 


0 <6 < 71 , 


or cos 6 = 


U V 

MM 


Using this definition we can rewrite the formula (1.21) 


IIU-VII 2 = ||U|| 2 + ||V|| 2 — 2U • V 


||XJ — V|| 2 = ||U|| 2 + ||V|| 2 -2||U||||V||cos0. 


as 
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||U — V||, ||U||, and ||V|| are the lengths of sides of the triangle OUV in Figure 1.15. 
Thus (1.21) is the Law of Cosines in R n . 


Fig. 1.15 For U and V in 



Orthonormal set. We have used the set of coordinate vectors E ; in R n whose 7 -th 
component is 1 and all other components are zero. 

Ei = (1,0,0,0,... ,0) 

E 2 = (0,1,0,0,...,0) 

e 3 = ( 0 , 0 , 1 , 0 ,..., 0 ) 


E n = (0,0,0,0,..., 1) 

Every vector U = (u \, U 2 ,... ,u n ) in R n can be written as a linear combination of these 
vectors, 

U = u \Ei + w 2 E 2 + • • • + u n TL n . 

The list of vectors E ; - is called the standard basis in R n . 

Definition 1.14. A list of vectors Vi, V 2 , •.. ,V m in R n is called an orthonor¬ 
mal set when two properties hold: 

(a) The vectors Yj are pairwise orthogonal: 

Yj m Y k = 0 for all j±k. 

(a) Each vector Yj has norm 1: 

IIV 7 II - 1 for all j. 


The standard basis of R n is an orthonormal set. 
Example 1.14. The vectors 

Qi = (1,1,1,1), Q 2 = (1,1,-1,-1), 


Q 3 = (1,-1,1,-1), Q 4 = (-1,1,1,-!) 
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are pairwise orthogonal. For example, Q 2 -Q 3 = l-1-1 + 1=0. Each has 
norm 2 . For example, 11 Q 411 = Vl + 1 + 1 + 1 = 2 . Dividing by the norms there¬ 
fore gives unit vectors V 7 = ^Q ; -; 


Vi=(iiii), v 2 = (i i-i-i), 


\ 7 _(l 11 \\ \J _ ( 111 \\ 

▼3 —V9’ 2’2’ 2/’ ^4 — V 2’2’2’ 2/ 


are an orthonormal set in ] 


□ 


We show now that there are many more orthonormal sets. The basic result is the 
following. 


Theorem 1.14. Let n> 2 and k<n. Let Wi, W 2 ,..., be vectors in R”. Then 
there is a nonzero vector V orthogonal to each of the vectors W i, / = 1,2,..., /:. 


Proof We argue by induction on n. The case n- 2 is simple. If Wi is the zero vector 
we may take any vector for V. If Wi = ( a,b ) ^ 0 we take V = ( -b,a ). Suppose 
inductively that the theorem holds for n - 1, where n > 3. The desired relations 
W 7 • V = 0, 7 = 1 , 2 ,...,/:, form a system of k linear equations for the n components 
vi,V2,...,v„ of V: 


wj\V\ +Wj2V2 4- \-Wj n v n = 0, 7 = 1,2,...,/:. (1-24) 

We look at the last terms on the left, Wj n v n . If all the numbers Wj n , j = 1,2, ...,/: are 
zero, then we can satisfy these equations by choosing vi,V2,...,v w _i equal to zero 
and v n equal to 1. If one of the numbers wj n is nonzero, we use the 7 -th equation 
in (1.24) to express v n as a linear combination of vi, V2,..., v n -\ . In case k = 1 there 
is no further equation and vi,V2,...,v w _i may be chosen arbitrarily; otherwise set 
the expression for v n into the remaining equations (1.24) we get a system of k- 1 
equations of form (1.24) for the n— 1 unknowns vi,V2,.--,v w _i. By the induction 
hypothesis this system has a nonzero solution. □ 

We use Theorem 1.14 to construct many orthonormal sets Vi, V 2 ,...,\ n in R n : 
Choose Vi as any vector of norm 1. According to Theorem 1.14 there is a nonzero 
vector, call it V2, orthogonal to Vi. Again using Theorem 1.14 with k- 2, there is 
a nonzero vector V3 that is orthogonal to both Vi and V2. Proceeding in this way 
we construct a set of n nonzero vectors Vy that are pairwise orthogonal. Then we 
multiply each vector with a suitable number to make a vector of norm 1 . 


Theorem 1.15. If vectors Vi,..., V m are an orthonormal set in then they 
are linearly independent. 


Proof Suppose there is a linear relation 
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Cl V !+••• + c m Y m — 0. 

Take the dot product with V*; since V/ • Yj = 0 when j =£ /, and V/ • V/ = 1, we get 
Ci = 0. Since this holds for each i this proves the linear independence. □ 

By Theorem 1.9 every vector X in R n can be expressed as a linear combination 
of the linearly independent vectors V i ,..., Y n 

X = CiVi H-h c n Y n . 


To find the cj when the Yj are orthonormal, dot both sides of the equation with Vi 
to see that 

V 1 -X = V 1 -(ciV 1 +--- + C „V„) = Cl ||V 1 || 2 + C2 V 1 -V 2 + --- = c 1 . 

Similarly each cj = Yj • X. Thus we have proved the following theorem. 

Theorem 1.16. Let Vi,... , Y n be an orthonormal set of vectors in R n . Then 
every vector X in R n can be expressed as the linear combination 

X = (Vi • X)Vi + • • • + (V„ • X)V W . (1.25) 


Example 1.15. To write X = (1,2,3,4) as a linear combination of the set of 
orthonormal vectors Vi, V 2 , V 3 , V 4 in Example 1.14, we find 

c 1 =V 1 -X=i(l)+|(2)+i(3)+i(4) = 5 
c 2 = V 2 • X = -2 
C 3 =V 3 -X = -1 
c 4 = V 4 • X = 0 


and so 



□ 


Problems 

1.53. Denote the dot product function b(\J, V) = U • V. Verify that the dot product 
is distributive and commutative and show that it is a symmetric bilinear function by 
verifying the following properties: 

(a) b( U, V) is bilinear 

(b) fe(U,V) = fe(V,U) 
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1.54. Let U, V, W, and X be the points on the rectangular box in R 3 shown in 
Figure 1.16. Find the following distances: 

(a) HV-UH, 

(b) the distance between U and the origin, 

(c) the distance between U and V + W, 

(d) ||W-X||. 



Fig. 1.16 The points used in Problem 1.54. 


1.55. Find a vector W in R 5 orthogonal to all three of the vectors 

(1,2,0,0, -2), (-2,1,2,0,0), (0, -2,1,0,2). 

1.56. Which of the following vectors are unit vectors? 

(a) 5 ^( 3 ,4,5), 

(b) -U if U is, 

(c) if U = (Wi,W 2 , W 3 , W4, W5, U^) is. 

(d) i(i,-V 2 , V5.-V3). 

1.57. Determine whether the following pairs of vectors are orthogonal to each other. 

(a) ( 1 , 1 , 1 , 1 , 1 ) and (- 1 ,- 1 ,- 1 ,- 1 ,- 1 ), 

(b) (1,1,1,1) and (-1,-1,-1,3), 

(c) ( 1 , 1 , 1 ) and (- 1 , 2 ,- 1 ). 

1.58. Show that if X and Y are orthogonal to each other in R n then 

l|X + Y|| 2 = ||X|| 2 + ||Y|| 2 . 

This is sometimes called the Pythagorean theorem in R n . 

1.59. Let u and v be numbers. Use the algebraic identity 

(u - v ) 2 = u 2 - 2 uv + v 2 

n times to prove that for all vectors U and V in R n , 

IIU - VII 2 = ||U|| 2 + ||V|| 2 -2U-V. 
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1.60. Find the maximum norm of X = (x\ , X 2 , ..., xioo) in the unit cube in R 100 , 

0 <*i<l, 0 <X 2 < 1 , ... 0 <xioo<l. 

1.61. Imagine an ^-dimensional cube in R n with edge length c> 0, consisting of all 
the points X = (x \, X 2 ,..., x n ) with 


0 < Xk < c, k = 

(a) Find the point P in the cube that has the largest norm. Call P the far comer of the 
cube. 

(b) For what value of c is the far corner P on the unit sphere of R n , ||X|| = 1? 

(c) Keeping the far corner point on the unit sphere, what happens to the edge length 
c as the dimension n tends to infinity? 

1.62. Let C = (c\ ,..., c n ) be a given vector in R^ and let X = (x \,..., x n ). Show that 
the function 

/(X) = *!+ C-X 

is linear, by finding D = ( d \,..., d n ) to express / in the form /(X) = D • X. 

1.63. Let Wi = (1,1,1,0) and W 2 = (0,1,1,1). Find two linearly independent vectors 
that are orthogonal to both W 1 and W 2 . 

1.64. Our proof of the Cauchy-Schwarz inequality, Theorem 1.12, used that when 
U is a unit vector, 

0 < ||V - (U • V)U || 2 = ||V || 2 - (U • V) 2 . 

(a) Show that if U is a unit vector and |U • V| = ||U|| ||V||, then V = (U • V)U. 

(b) Show that equality occurs in the Cauchy-Schwarz inequality for two arbitrary 
vectors V and W only if one of the vectors is a multiple of the other vector. 



Fig. 1.17 The icosahedron in a cube, for Problem 1.65. 
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1.65. The regular icosahedron fits nicely into a cube, as shown in Figure 1.17. It has 
twenty equilateral triangle faces. In the cube, 0 < x < 2, 0 < y < 2, and 0 < z < 2. 
Points A and B are located on the face of the cube where x = 2, and are equally 
spaced from the center of that face, so A = (2,1 - h, 1) and B = (2,1 + h, 1) for some 
number h > 0. 

(a) Find the coordinates of points C and D in terms of h. 

(b) Express the distance between A and B in terms of h. 

(c) Express the distance between A and D in terms of h. 

(d) Find h. 

1.66. Let 

Vi = (tf,fc,...,fc), V 2 = (b,a,b,...,b), V 3 = (b,b,a,...,b), ... V„ = (&,...,M) 

be n vectors in R n , where n > 1. Find numbers a and b so that Vi,...,V n is an 
orthonormal set. 


1.67. Use the triangle inequality as needed to prove the following inequalities, 
where a and b are numbers and X and Y are vectors in R n . 


(a) 

(b) 

(c) 

(d) 


\a\ < \a — b\ + \b\ 
a\ - \b\ <\a — b\ 
\a\-\b\\<\a-b\ 

||X|| - ||Y||| < ||X — Y|| 


1.6 The determinant 


The determinant of order n is a number valued function of an ordered list of n vectors 
Vi,V 2 ,...,V„ each with n components. We denote it as 

det(Vi, V 2 , • •V«). 

Before giving the formula for the determinant we list its algebraic properties: 

(i) det(Vi, V 2 ,..., V w ) is a multilinear function, that is, a linear function of each 
\i when all other Vy, j ± i are held fixed. 

(ii) If two vectors V/ and \j of the ordered list are equal, the value of the deter¬ 
minant is zero: 

det(...V, ...,V,...) = 0. 

(iii) det(Ei,E 2 ,... ,E W ) = 1, where E* is the vector whose z'-th component is 1 and 
all other components are zero. 

We deduce three further properties from the properties above: 

(iv) If two of the vectors in the ordered list are interchanged, the value of the 
determinant is multiplied by -1. 
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Proof. Using repeatedly properties (i) and (ii) of the determinant we get the follow¬ 
ing sequence of identities, where we only indicate the i -th and j-th vectors U and V 
of the ordered list: 


0 = det(U + V, U + V) = det(U, U) + det(U, V) + det(V, U) + det(V, V). 


Therefore 


0 = det(U, V) + det(V, U). 


This proves property (iv). □ 

(v) IfVi,V 2 ,...,V w are linearly dependent, then det(V \ ,V 2 ,...,\ n ) = 0. 


Proof If the V/ are linearly dependent, one of the V/, say Vi, is a linear combination 
of the others: 

n 

v. = 2>/V/ 

i-2 

Using the multilinear property of the determinant we have 

( n \ n 

2 mjdctCVj, V 2 ,..., V„). 

i=2 J i-2 

In each term of the sum on the right side of the last equation two of the vectors in 
det are equal. Therefore by property (ii) each term of the sum is zero. It follows that 
the whole sum is zero. This proves property (v). □ 


Next we show the converse of property (v). 

(vi) If the vectors Vi, Y 2 ,..., \ n are linearly independent, det(V 1 , V 2 ,...,\ n ) is 
not zero. 


Proof Since the vectors Vi, V 2 ,..., \ n are linearly independent, according to The¬ 
orem 1.9 we can express every vector as a linear combination of them. In particular 
the unit coordinate vectors E/ can be expressed: 

E .- = 2>V;. 

j= 1 


Since the determinant is a multilinear function, we write 

( n \ n 


det(Ei, E 2 ,..., E n ) = det 


bji Vj, E 2 ,..., E„ = Yj bjidetCVj, E 2 ,..., E„) 


n f n \ n n 

= yVjidet V > £ft 42 V b E 3 ,...,Ei I = fjbji f^b k 2det(V j,\ k , E3, ...,E«) 

7=1 V k-\ / 7=1 k- 1 


d-26) 
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Continuing in this fashion we get an expression of det(Ei,E 2 ,...,E n ) as a linear 
combination of the determinants det(V 7 -,V^,... ,V Z ) where Vy, V&,..., V z is a per¬ 
mutation of V i, V 2 , • • •, V n and therefore 

det(Ei,E 2 ,...,E„)= ^ (bj l b k2 ---b zn )det(Vj,V k ,...,\ z ). 

permutations jk—z 

But for each permutation, either 

det(V j, \ k ,..., V z ) = det(V!, V 2 ,..., V„) 


or 

det(V j, V*. \ z ) = -det(V!, V 2 ,..., V„) 

Therefore det(Ei,E 2 ,...,E n ) is a multiple of det(Vi, V 2 ,..., V„). Since 


det(Ei,E 2 ,... ,E n ) = 1 


it follows that det(Vi, V 2 ,..., \ n ) is not zero. This proves property (vi). □ 

We shall show that properties (i), (ii), and (iii) completely characterize the deter¬ 
minant. First we derive from these three properties a formula that holds for every 
function satisfying the properties. Then we show that a function defined by this 
formula satisfies the three properties. 

To use properties (i), (ii), and (iii) of det to devise a formula for the determinant, 
we take an ordered list of n vectors V i, V 2 ,..., \ n in R n . Denote the k- th component 
of \j by vjcj, (j = 1,2,...,ft; k- 1,2,...,ft). We write each vector \j as a linear 
combination of the unit vectors E^: 

n 

\j = Y J n j E k . (1.27) 

k= 1 


Using the above expression for Vi we write 


det(Vi,V 2 ,...,V n ) = det 


^V£iE £,V 2 ,...,V w 


\k= 1 




Using multilinearity of the determinant we can rewrite the right side as 


n 

£v* 1 det(E*,V 2 ,...,V l ,). 

k= 1 


Next we use the expression (1.27) for V 2 in each term of the sum above, and multi¬ 
linearity to rewrite each term of this sum. We get the double sum 


det(V! ,V 2 ,...,V„)=yy v ki vndet(E k , E e , V 3 ,..., V„). 

k= 1 €=l 
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Proceeding in this manner we get the formula as an n-tuple sum 

n 

det(Vi,V 2 ,...,V„)= ^ (vkiv( 2 ---v zn ) det(E*,E f ,...,E z ). (1.28) 

k,z= 1 

Next we use properties (i), (ii), and (iii) to determine det(E^,E^,... ,E Z ). 

According to property (ii) the determinant is zero when two of the vectors in the 
list are equal. This shows that in formula (1.28) we can restrict the summation to the 
case where k, l, ... , z is a permutation of 1, 2 ,..., n. 

Next we define the signature of a permutation as follows. Denote by 

P = P\P2"'Pn 

a permutation of 1,2,..., n. That is, 

p(l) = p u p( 2 ) = p 2 , ... pin) = p n . 

Form the two products 

Y](x Pl -x Pj ) (1.29) 

i<j 

and 

~[(xi-Xj) (1.30) 

i<j 

Each factor in the product (1.29) is equal to one of the factors in the product (1.30) 
or its negative. Therefore the two products are equal or are the negatives of each 
other. 


Definition 1.15. We define the signature of a permutation p of 1, 2,..., n to be 
the number s(p) = 1 or -1, such that 

r>„ - x Pj ) = S(p ) [> - Xj). (1.31) 

i<j i<j 


Signature s(p) has following properties: 

(a) The signature of an interchange is -1. 

(b) The signature of the composite of two permutations is the product of their signa¬ 
tures: 

s(pq) = s(p)s(q). 

Proof, (a) When we interchange Xk and x m , (k < m), for every l between k and m 
both xt - Xk and x m - X£ change sign, an even number of sign changes. In addition 
x m - Xk changes sign, an odd total number of sign changes. 

Property (b) follows directly from the definition (1.31), as we ask you to show in 
Problem 1.75. □ 



1.6 The determinant 


37 


Every permutation p can be obtained as a composite of interchanges. Let c(p ) be 
the number of interchanges. If follows from properties (a) and (b) that 

s(p) = (-lf p> . 

Example 1.16. Let p- 312 be a permutation of 1,2,3, that is, 

P\ = 3, /?2 = 1, P3=2. 

Two interchanges bring 312 into 123: one interchange takes 312 to 132, and a 
second interchange takes 132 to 123. Therefore s(p) = 1. □ 

Example 1.17. One interchange takes 15342 to 12345, so s(15342) = -1. □ 

According to property (iv) a factor of -1 is introduced with each interchange of 
vectors, and using (iii) we get 

det(E,„ ,...,E Pn ) = (-1 ) c( ^ det(Ei,... ,E„) = (-l) c ^ = s(p). 

Set this result into formula (1.28) for the determinant. We have shown that if a 
function satisfies the three properties (i), (ii), and (iii) then the value it assigns to 
(Vi,...,V„) is 

YjS(p)v pil V p2 2---V Pnn 

P 

summed over all permutations p = p\P2'"Pn of 1,2,..., n. Therefore we make the 
following definition. 

Definition 1.16. Let Vi,... ,V n be vectors in R n written as columns, 



'vn" 


Vl2 


'Vln' 

Vi = 

V21 

ii 

<N 

> 

V22 

v„ = 

V2 n 


.v„i. 


_V^2_ 


. Vnn . 


The determinant is defined as 

det(Vi,V 2 ,...,Vn) = E s( P') v p\ I V P2 2 ■ ■ • v p n n (1-32) 

p 

where the sum is over all permutations p = pip 2 ---p n rf\, 2 ,...,n. 


The 2 x 2 case. Lor n = 2 let p = 12 and q = 21 be the permutations of two numbers 
1,2. No interchanges are needed for p , so the signature s(p ) = 1. One interchange 
takes 21 to 12 so s(q) = -1. Lor 


Vi 


vn 

V21 


V 2 = 


V12 

V22 
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the determinant is 


det(V i, V 2 ) = s(p)v Pl i v P2 2 + s(q)v qx \ v qi2 
= s( 1 2 )V\_\ V 22 + s( 1V)V2\V\2 
= V11V22 ~V 2 \V\2 


where the sum is over the two permutations of 1 , 2 . 

Example 1.18. det([_ 2 2 ],[f]) = (2)(l)-(-2)(5) = 12. □ 

We verify now that the determinant as defined by formula (1.32) has the three 
algebraic properties listed at the beginning of this section: 

(i) Each term in the sum on the right in (1.32) is a multilinear function of the Yj. 
Therefore so is their sum. 

(ii) In definition (1.32) interchange V/ and Yj where i < j. We get 

det(. ..,\j,...,y i ,...) = Y J s(p)(- ■ • v Pj i ■ ■ ■ v Pij ■■■). (1.33) 

P 

Let r be the interchange of i and j , and denote the permutation pr as q. Since the 
factors v Pj i and v Pi j can be interchanged with no effect, we can rewrite (1.33) as a 
sum over all permutations q\ 

det(..., Yj, • • •, V;,...) = s(p)(Vq l \Vq 2 2 • • • Vq n n ) (1.34) 

q 

According to the product formula s(q) = s(pr) = s(p)s(r). The signature s(r) of an 
interchange r is -1. Therefore s(q) - -s(p). Set this in formula (1.34); we get 

det(..., V/,..., V;,...) = -det(..., V*-,..., V 7 -,...). (1.35) 

In words: if two vectors in the list are interchanged, the value of det is multi¬ 
plied by -1. It follows from (1.35) that if two of the vectors V; and Yj are equal, 
det(Vi,...,V„) = 0 . 

(iii) For Yj = E ; -, (j = 1,2,... ,n), the sum on the right side of formula (1.32) for 
the determinant has only one nonzero term, corresponding to the trivial permutation 
12 ••• n, and that term is equal to 1. This shows that 

det(Ei,E 2 ,...,E„) = 1. 

This completes the proof that the determinant defined by formula (1.32) has the 
three properties listed at the beginning of this section. 

The components of n column vectors in R n form a rectangular array of numbers 
called a matrix. (See also Section 1.8.) We write 


M = [V! V 2 - V*], 


detM = det(Vi,V 2 ,...,VJ. 
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We use Definition 1.16 to compute the determinant of M. For example, using the 
vectors in Example 1.18, 


det 


2 5 
-2 1 


- (2)(1) - (—2)(5) = 12. 


The 3x3 case. Let M = 


m ii m\2 vn\2 
ni2\ rri22 m3 
mi mi mi 


. For n- 3 there are six permutations, 


123, 132, 213, 231, 312, 321. 


Using (1.32) 

detM = s(l23)mi\ni22m3 + 13 2)m 1^32^23 + s(213)m2imi2m33 

+ ^(231)^21^32^13 + s(312)m3imi2m23 + s(321)m3im22^i3 

= m\\ni22m3 ~m\m32ni23 -m2\m\2m33 

+ m 2 \m 32 m \3 -\-m 31 m 12 m 23 - m 3 \m 22 M\ 3 . (1.36) 

This sum can be expressed in several ways; for example we can factor out the num¬ 
bers of the first column, mj\ , giving 

detM = m\\(ni22m3 -ni32m3) 

-m2i{mi2m33-m32mi3) + m3i(m\2rri23-m22rni3). (1.37) 

If we factor out the numbers of the first row, m\k, we get 

detM = mii(m 2 2m33 -m32m23) + mi2(-m2im 3 3 +m3im23) + mi3(m2im 3 2-m3im22) 
and this is equal to 


= mu det 


m 2 2 m 3 

m32 rri33 


-m^det 


m 1 m 2 3 

m\ m3 


+ mi3det 


mi mi 
mi mi 


Example 1.19. 


det 


0 1 2 
5 67 
-13 2 


= 0 - ldet 


5 7 
-1 2 


+ 2det 


5 6 
-1 3 


= 0 - (10 + 7) + 2(15 + 6) = 25. 


□ 


M = 


0 0 0 d 
OHO 
a 0 0 0 
OOcO. 


Example 1.20. Let 
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The determinant must be either abed or - abed , since the only nonzero term in 
detM is the one containing m 3 im 22 ^ 43 ^i 4 = abed. In Problem 1.72 you find 
that the signature of 3241 is +1. Therefore 

detM = abed. 


□ 

Determinant and orientation. We turn now to the geometric meaning of the deter¬ 
minant. We use the notion of determinant to define algebraically the orientation of 
an ordered list of n linearly independent vectors in R n . 


Definition 1.17. Let Vi,V 2 ,...,V n be an ordered list of n linearly inde¬ 
pendent vectors in R n . The ordered list is called positively oriented if 
det(Vi, V 2 ,..., y n ) is positive, and negatively oriented if det(V 1 , V 2 ,..., \ n ) 
is negative. By property (vi) det(Vi, V 2 ,..., Y n ) ± 0. 


Note that by property (iii) of the determinant det(Ei, E 2 ,..., E n ) = 1 and by prop¬ 
erty (i) det is multilinear. Therefore 

(a) the ordered list of vectors Ei,E 2 ,...,E„ is positively oriented. 

(b) the ordered list of vectors Ei, E 2 ,..., E n _i, -E n is negatively oriented. 

See Figures 1.18 and 1.19. 



Fig. 1.18 Ei,E 2 ,E 3 are positively oriented in R 3 . 


An ft-tuple U(0 = ( u\(t ), u^it ),..., u n {t)) of continuous functions Ui from an inter¬ 
val in R to R is called a continuous vector function in R n . A deformation of an 
ordered list of linearly independent vectors is an ordered list of continuous vec¬ 
tor functions Vi (t),\ 2 (t),...,\ n (t) in R n such that for every t the vectors are lin¬ 
early independent. The determinant of a deformation is nonzero for every t. Since 
det (Vi(0,..., V«(0) is a sum of products of the component functions and each com¬ 
ponent of Y j(t) is a continuous function of t , det(Vi(f),.. •, V w (0) is continuous. It 
follows from the Intermediate Value Theorem that the continuous function of one 
variable, 


det(Vi(0,V2(0,...,V„(0)^0, 
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has the same sign for all t. This shows that the ordered lists 
have the same orientation for all t. 



Fig. 1.19 Left: The ordered list Vi,V 2 is negatively oriented. Right: The ordered list Vi,V 2 is 
positively oriented. 


The following is the basic result about the geometric meaning of orientation. 

Theorem 1 . 11 , (a) Every positively oriented ordered list of n linearly 
independent vectors Vi, V 2 can be deformed into the ordered 

list of unit vectors Ei, E 2 ,..., E n . 

(b) Every negatively oriented ordered list of vectors in W 1 can be deformed 
into the ordered list Ei, E 2 ,..., -E n . 


Proof. We outline a proof by induction on n. See Figure 1.20. 

(a) Suppose the vectors Vi,..., V w are positively oriented. There is a rotation that 
takes the vector \ n into pE n , p a positive number. This rotation carries the 
vectors Yj into vectors we denote as W f (j = 1,2, ...,n-l). We follow this by 
shrinking the n -th component of each of the vectors W ; -, j < n, to zero. This 
amounts to adding a multiple of the n-th vector W n = pE n to each W ; -, j < n , 
which does not change the determinant of the resulting set of vectors. Therefore 
the resulting vectors in the hyperplane x n = 0 are linearly independent. 

Denote by Z j, j < n , the (n - l)-dimensional vectors obtained by omitting the 
n -th component of W ; -. Since the omitted components are zero, these are a set 
of (n—Y) linearly independent vectors with (n- 1) components. The determinant 
of the n vectors in their new position is equal to p times the (n— 1) by (n- 1) 
determinant of the vectors Zi,Z 2 ,...,Z w _i. Since the determinant is nonzero 
under a deformation, its sign doesn’t change. Since p is positive, it follows that 
det(Wi,W 2 ,...,W n ) is positive. By the induction hypothesis this list of (n- 1) 
positively ordered vectors with (n - 1) coordinates can be deformed into the 
ordered list Ei,E 2 ,...,E n _i. In one dimension a positively oriented vector is a 
positive number, which can be deformed into the number 1. 

This completes the outline of the proof that the positively oriented list of vectors 
Vi,V 2 ,...,V„ can be deformed into the list of vectors Ei, E 2 ,..., E n . 
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(b) Analogously one shows that if the ordered list Vi, V 2 ,..., V« negatively oriented, 
it can be deformed into the ordered list Ei, E2,..., -E„. 


□ 




Fig. 1.20 Notation in the proof of Theorem 1.17 for n = 3. 


Problems 


1.68. Evaluate the determinants 
1 O' 


(a) det 

(b) det 

(c) det 

(d) det 

(e) det 


0 1 
1 0 
0 -1 
1 2 
0 -1 
1 4 
1 4 

U V], where the columns U and V are linearly dependent in ] 


1.69. Show det[U V] is a bilinear function of pairs of column vectors U, V in R 2 by 
verifying that: 

(a) det[U + W V] = det[U V] + det[W V] and det[U V + W] = det[U V] + det[U W], 

(b) det[cU V] = cdet[U V] and det[U cV] = cdet[U V]. 


1.70. Use bilinearity of the determinant function to show that each expression is 
zero. 


(a) det 


5 a b 
5c d 


-5det 


a b 
c d 














1.6 The determinant 


43 


X z 

V 0 

1 . 71 . Evaluate the determinants. 




1 

0 

O' 

(a) 

det 

0 

1 

0 



0 

0 

1 



1 

0 

0 

(b) 

det 

0 

1 

0 



0 

0 

-1 



1 

0 

(c) 

det 

0 

- 

1 



0 

0 - 



1 

0 

o' 

(d) 

det 

0 

0 

2 



0 

3 

0 



0 

0 

3 

(e) 

det 

0 

2 

0 



1 

0 

0 


(b) det 


x y-z 

V w 


-det 


x y 

V w 


+ det 


1 . 72 . Find the signature £(3241) from the equation 


C*3 - X 2 )(x 3 - M )(*3 - X\)(x 2 - X 4 )(x 2 ~ X\){x 4 ~ X\) 


= £(3241)(*i - x 2 )(xi - x 3 )(xi - x 4 )(x 2 - x 3 )(x 2 - x 4 )(x 3 - x 4 ). 


1 . 73 . In the permutation 3241 there is an even number of cases where a larger num¬ 
ber is to the left of a smaller one, 41, 21, 31, 32, and the signature is +1. (See 
Problem 1.72.) 

(a) Prove in general that the signature of a permutation is +1 if there is an even 
number of such cases and is -1 if there is an odd number. 

(b) Find ^(1237456). 

(c) Find s(1273456). 


1 . 74 . Evaluate the determinants. 


(a) det 


(b) det 


100 ••• O' 
020 ■■■0 
0 0 3 ---0 


.000 ■■■ wJ 
n 1 1 -l 

0 / 2-1 1 ••• 1 
0 0 n -2 ••• 1 


0 0 0 3 1 

0 0 0 ---2 
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1.75. When p and q are permutations of l,2,3,...,n we write the composition q 
followed by p as pq. Show that the signature of permutations has the property 
s(pq) = s(p)s(q). 

1.76. Use the result of Problem 1.75 to show that a permutation and its inverse have 
the same signature. 

1.77. Show that the orientation of the ordered list of vectors Ei, E 3 , E 2 in R 3 , and 
the signature s( 132), are both negative. 

1.78. Show that the orientation of the ordered list of vectors E 3 , Ei, E 2 in R 3 , and 
the signature s( 312), are both positive. 


1.7 Signed volume 

We start by defining a simplex. 

Definition 1.18. Let k <n and let Vi,..., V& be linearly independent vectors 
in R n . A k-simplex with vertices 0, V 1 ,..., in R n is the set of all points 


k 



X = ciVi+csV 2 + - + c*V* where 


We say the simplex is ordered if the order of the vertices is specified. 

Example 1.21. The 2-simplex with vertices (0,0),(1,0),(0,1) is the triangular 
region of R 2 drawn in the left side of Figure 1.21. The 3-simplex with vertices 
(0,0,0), (1,0,0), (0,1,0), (0,0,1) is the solid tetrahedron in R 3 shown in the 
center of the figure. The 2-simplex with vertices (0,0,0), (0,1,0), (0,0,1) is 
the triangular surface in R 3 shown at the right of the figure. □ 



Fig. 1.21 Left: A 2-simplex in R 2 . Center: A 3-simplex in R 3 . Right: A 2-simplex in R 3 . See 
Example 1.21. 

For each j = 1 ,...,& the j-face of the ^-simplex is the ordered (k - 1)- 
simplex whose vertices are the vectors 0, Vi,..., V& omitting \j. 
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Example 1.22. The 1-face of the ordered 3-simplex with vertices 
(0,0,0), (1,0,0), (0,1,0), (0,0,1) 
in Example 1.21 is the ordered 2-simplex with vertices 
(0,0,0), (0,1,0), (0,0,1). 


See Figure 1.21. □ 

We define the distance from a point V in R n to the hyperplane spanned by n - 1 
linearly independent vectors as follows: By Theorem 1.14 there is a unit vector N 
that is orthogonal to the hyperplane. Define the distance from V to the hyperplane 
to be |V -N|. 

The volume of an ^-simplex is defined to be the product of the (n - l)-dimensional 
volume of the y'-face times the distance from Vy to the hyperplane spanned by the 
Yi,(i ± j ) that contains the y-face, divided by n. As we shall see, this number is 
the same for all y. To complete this definition we need the volume of the face. In 
the case of a 3-simplex in R 3 , the face is a triangular region that lies in a two- 
dimensional plane. The volume of the face is then the area of that triangle. In higher 
dimensions it is possible to introduce a coordinate system with n - 1 coordinates 
in the hyperplane containing the face, so that the volume of a face can be defined 
inductively by dimension. 


Definition 1.19. The signed volume , denoted S(Y\,...,Y n ), of the ordered 
^-simplex with vertices 0, Vi ,..., Y n is defined as the ^-dimensional volume 
of the simplex in case the ordered collection Vi ,..., V n is positively oriented, 
and as minus the volume in case the ordered collection Vi,..., V n is negatively 
oriented, 


We shall derive a formula for the signed volume and show that it is a multilinear 
function of Vi,..., Y n , 

Let Vi,..., V n be an ordered set of linearly independent vectors. We define the 
signed distance s(Vy) of a vector Yj from the hyperplane spanned by the vectors 
Vi,...,Vi,...,V n , (i ± j ) as follows: 

(a) s(Y j) is the distance of Vy from the hyperplane spanned by the vectors V/, (i ± j ), 
in case the ^-simplex S(Y i,..., Y n ) is positively oriented. 

(b) s(Y j) is - 1 times the distance of Vy from the hyperplane spanned by the vectors 
V/, (i ± y), in case the ^-simplex S(Y i,..., Y n ) is negatively oriented. 

(c) In case the vectors V; are linearly dependent, we define s(Vy) as zero. 

In Problem 1.83 we ask you to verify that the function s(V) is a linear function 
in each half-space outside the hyperplane. Let W be the reflection of V across the 
hyperplane. It follows from the definition above of signed distance that s( W) = 
-s(Y). This shows that s(Y) is a linear function in the whole space. 
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The signed volume of the ^-simplex is equal to the product of ^(V w ) times the 
(n- l)-dimensional volume of the face with vertices 0,Vi,V2,...,V n -i. By the 
induction hypothesis the (n - l)-volume of this face is a multilinear function of the 
variables Vj, (i < n). This proves that the signed volume 5 (Vi,V2 ,...,V w ) is a 
multilinear function. That is property (i) of the determinant as defined in Section 1.6. 

If two of the Yj are the same, the volume is zero. That is property (ii) of the 
determinant. 

The volume of the simplex whose vertices are the unit vectors Ei,E 2 ,...,E n is 
. This shows that 

n!5(V 1 ,V 2 ,...,V n ) 

has the property (iii) of the determinant. We have seen that the three properties 
characterize the determinant. Therefore it follows that n!S(Vi,V 2 ,...,V n ) is the 
determinant. Dividing by n\ we get the formula for the signed volume of an ordered 
simplex: 

<S , (Vi,V 2 ,...,Vn) = —7 det(V i, V 2 ,..., V„). 
n\ 



Fig. 1.22 The tetrahedron in Example 1.23. 


Example 1.23. The tetrahedron with ordered vertices (0,0,0), (1,1,3), (2,4,1), 
(5,2,2) (see Figure 1.22) has signed volume 


^jdet 


1 25 
1 42 
3 1 2 


= i (1(4(2) -1(2)) - 2(1(2) - 3(2)) + 5(1(1) - 3(4))) 


_ 6 + 8-55 _ 41 

6 “ 6 ' 


□ 

The parallelopiped in R n determined by vectors Vi,..., Y n is the set of points 


ciVi h-1 - c n Y n , 


0<Ci< 1. 
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The signed volume of the parallelopiped determined by Vi ,...,\ n is ft! times the 
signed volume of the ft-simplex and is equal to 

det(Vi,..., V w ). 

Example 1.24. The signed volume of the parallelopiped determined by ver¬ 
tices (1,1,3), (2,4,1), (5,2,2) is 


det 


1 2 5 
1 4 2 
3 1 2 


-41. 


See Example 1.23 and Figure 1.23. 


□ 



Problems 

1.79. Justify the following items to prove that the area of the triangle with vertices 
0, U = {u\ , U 2 ) and V = (vi,V 2 ) is ^|ftiV 2 - U 2 V\\. (See Figure 1.24.) 

(a) The area of the triangle is ^ ||U||(||V|| sin#). 

(b) sin# = V 1- ( ilUTiWil)”- 

(c) The area is equal to ± VllU|| 2 ||V|| 2 - (U • V) 2 . 

(d) The expression ||U|| 2 ||V|| 2 - (U • V) 2 is equal to the perfect square (u\ V 2 - U 2 V 1 ) 2 . 
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Fig. 1.24 The triangle in Problem 1.79. 


1.80. Show that the signed area of the ordered triangle with vertices 0, U = (u\,U 2 ) 
and V = (vi,V 2 ) is ^(u\V 2 - U 2 V\). (See Figure 1.24 and Problem 1.79.) 

1.81. Find the area of the parallelogram with vertices (0,0), (1,3), (2,1), and (3,4). 
(See Problem 1.79.) 

1.82. Draw the ordered tetrahedra (ordered simplices in R 3 ) with the following ver¬ 
tices and find their signed volume. 

(a) 0, U = (2,1,0), V = (1,2,0) and W = (0,0,7). 

(b) 0, U = (2,1,0), V = (1,2,0) and W = (7,7,7). 

(c) 0, U = (2,1,0), W = (7,7,7) and V = (1,2,0). 



Fig. 1.25 Signed distance s(V) for positively oriented vectors Vi, V. See Problem 1.83. 


1.83. Let Vi ^ 0, V and W be vectors in R 2 with Vi,V and Vi, W positively ori¬ 
ented. See Figure 1.25. Let s(Y) be the distance from V to the line containing Vi. 
Use a sketch to illustrate the following. 

(a) s(c\) = c s(V) for all numbers c > 0, 

(b) ^(y+w) = ^(y) + ^(W). 


1.8 Linear functions and their representation by matrices 


The notion of linear function is basic. 


1.8 Linear functions and their representation by matrices 


49 


Definition 1.20. A function F from R n to R k is called linear when it has the 
following properties: 

(a) For all numbers c and vectors V in R n , F(cV) = cF(V). 

(b) For all vectors V and W in R n , F(V + W) = F(V) + F(W). 


Combining properties (a) and (b) we deduce that for all vectors Vi,V 2 ,...,V^ 
and numbers c \, C 2 ,..., ct 

F(ci Vi + c 2 V 2 + • • • + CfSi) = ciF(Vi) + c 2 F(V 2 ) + • • • + qF(V^). (1.38) 

We describe now all linear functions from R n to R k . 

We express vector V = (vi,..., v n ) as a linear combination of the standard basis 
vectors Ei,...,E„ 

V = viEi + V2E2 + • • • + v^E^. 

Using equation (1.38) we get 

F(V) = viF(Ei) + v 2 F(E 2 ) + • • • + v w F(E w ). (1.39) 

This shows that the linear function F is determined by the vectors F(Ei),... ,F(E n ). 
Next we use formula (1.39) to characterize all linear functions. 

Choose vectors Mi, M 2 ,..., M„ in R k . We define the function F by 


F(V) = V 1 M 1 + V 2 M 2 h- \~v n M n (1-40) 

for every V = (vi, V2, ..., v n ) in R n . It is easy to verify, and we ask you to show in 
Problem 1.87 that the function F defined by formula (1.40) is linear. According to 
(1.39) all linear functions are of this form. So we have proved the following theorem. 


Theorem 1.18. A function F from R n to R k is linear if and only if there is a 
list of vectors Mi,..., M n in R k so that for every V = (vi,..., v n ) in R n 

F(V) = V 1 M 1 + V 2 M 2 H- 1 - v^Mft. 


Matrix notation. If F is a linear function denote the components of the vectors M* 
as • • •, mki, (i = 1, 2,..., n). We arrange these numbers in a rectangular array: 


mu rn \2 m\ n 


M = 


n% 2 \ m 22 • • • mn 


(1.41) 


L mi mi'" mien 


The y’-th column consists of the components of the vector My. 
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Definition 1.21. A rectangular array of numbers M in (1.41), with k rows and 
n columns is called a k by n (or k x n) matrix. The number in the i- th row and 
7 -th column is denoted as mtj. 


Definition 1.22. The product MV of a k by n matrix M and a column vector 
V whose components are vi,V 2 ,..., v n , is the column vector whose i- th com¬ 
ponent is 

mnvi +mi2V2 + --- + m in v n (i = 1,2,...,*:) 

In words: the i- th component of MV is the dot product of the i- th row of the 
matrix M and the vector V. 


Example 1.25. Here are three examples: 


1 2 

-1 


1 

3 4 

1 


1 


1 2 3 

1 


0 


1 4 



3 

4 5 6 

1 

= 

3 


2 1 

-1 

1 

= 

-1 

7 8 9 

-1 


6 


-3 3 


6 


□ 

Matrices give us an alternative way to represent linear functions. 

Theorem 1.19. Every linearfunction ¥ from R n to R k can be written in matrix 
form 

F(V) = MV 

for some kxn matrix M. 


Linear functions can be multiplied by a number, added and composed, and the 
result is a linear function; we formulate these operations as an “algebra” of linear 
functions. 

(i) For a linear function F from R n to R k and a number c the product cF is defined 
by 

(cF)(V) = cF(V). 


(ii) For a pair of linear functions F and G both mapping R n to R k , their sum F + G 
is defined by 


(F + G)(V) = F(V) + G(V). 


(iii) Denote by F a linear function from R n to R k and by G a linear function from 
R k to R m . Their composition is defined by 


GoF(V) = G(F(V)). 
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It is a function from R n to R m . 

We ask you in Problem 1.90 to verify that constant multiples, sums, and compos¬ 
ites of linear functions are linear. 

Since linear functions are represented by matrices, these operations can be 
expressed as an algebra of matrices: 

(i)’ For a matrix M and number c, cM is defined to be the matrix whose elements 
are c times the elements of M. 


(cM)ij = crriij. 

(ii)’ For a pair of k by n matrices M and N, their sum M + N is defined to be the 
k by n matrix whose elements are the sums of the elements of M and N: 


mij + nij. 

(iii)’ For a k by n matrix M and a m by k matrix N the matrix product NM is an 
m by n matrix whose i y-th element is the sum 


k 

'y ^ klihkklhj- 
h= 1 


One can think of this sum as the dot product of the i -th row of the matrix N and y-th 
column of the matrix M. 

Formulas (i)’ and (ii)’ clearly express rules (i) and (ii) for linear functions. We 
ask you in Problem 1.91 to verify that formula (iii)’ expresses rule (iii). 

Example 1.26. Here are some examples of matrix products: 


1 2 3 

1 -1 


14 0 

45 6 

2 -1 

= 

32 -3 

7 8 9 

3 1 


50 -6 


1 3 

2 1 


-1 10 

4 5 

-1 3 


3 19 


□ 

Matrix multiplication is not commutative; that is, KL and LK are in general not 
equal. 

Example 1.27. Let 


1 2 


3 4 



-1 

4 


then 


KL = 


1 2 

1 -1 


-5 7 


1 -1 

1 2 


<N 

1 

<N 

1 

3 4 

-3 4 

— 

-9 13 

, LK — 

-3 4 

3 4 

— 

9 10 


□ 
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If K and L are matrices, it is possible that one of the two products KL and LK is 
defined and the other not defined. 


We turn now to square matrices; they represent linear functions from R n to R n . 
Denote by l n the n x n matrix whose diagonal elements are 1 and off-diagonal ele¬ 


ments are zero. For n = 3 


13 = 


1 00 
0 1 0 
00 1 


l n is called the nxn identity matrix. Multiplication by l n is the identity function: 


I„V = V 


for every vector V in W 1 . 

The basic result about square matrices is the following. 


Theorem 1.20. Let M be an nxn matrix , and let U, V and W be vectors in 

R n . 

(a) //MV = 0 only for V = 0, then every vector W can be represented as 
W = MU for some U. 

(b) If every vector W can be represented as MU for some U then MV = 0 only 
for V = 0. 


We say that a function F is one to one if F(U) = F(V) only when U = V. Before 
presenting the proof of Theorem 1.20 we draw some of its consequences. In case (a) 
the function F defined by F(V) = MV is one to one. For if 

F(V) = F(U), 

then by linearity F(V - U) = F(V) - F(U) = 0. Since the only vector mapped into 
zero by multiplication by M is the zero vector, it follows that V - U = 0 and so 
V = U. Therefore every vector W can be represented as W = MU = F(U) for some 
U, and U is uniquely determined because F is one to one. That is, F has an inverse 
function that we denote F -1 . 

We show now that F -1 is linear. Let F(U) = W and F(V) = Z. Then by linearity 
F(U + V) = F(U) + F(V) = W + Z. 


By definition of inverse 

U = F“ 1 (W), V = F“ 1 (Z) and U + V = F -1 (W + Z). 

Therefore 

F -1 (W) + F” 1 (Z) = F _1 (W + Z). 


One can verify similarly that 





1.8 Linear functions and their representation by matrices 


53 


F -1 (cW) = cF _1 (W). 

According to Theorem 1.19 the inverse function F -1 can be represented by a 
matrix. We make the following definition. 

Definition 1.23. Let M be a square matrix, and denote the corresponding lin¬ 
ear function from R n to R n as F: 

F(V) = MV. 

If F has an inverse function F -1 then we say M is invertible and denote by 

M -1 

the matrix that represents F _1 . We call M -1 the inverse of the matrix M. 


We present now a proof of Theorem 1.20. 

Proof, (a) Take the case that MV = 0 only for the vector V = 0. We show that the 
n vectors MEi,ME 2 ,...,ME n are linearly independent. For suppose there were a 
nontrivial linear relation 


cqMEj + C2ME2 + • • • + c w ME w — 0 . 

By properties of matrices (i)’-(iii)’ we can write this relation as 
M(ciEi + C2E2 H-1- qE„) = 0 . 

Since MV = 0 only for V = 0, it follows that c\ = C 2 = • ■ ■ = c n - 0. This proves the 
linear independence of the vectors ME y -. According to Theorem 1.9, every vector 
W can be represented uniquely as 

W = fiqMEi + { 22 ME 2 + • • • + # W ME W . 

By linearity we can write this relation as 


W — M!({3qEl + ^2^2 + ' ' ' + Cly^jyi). 


This proves that every vector W can be represented as MU. 

(b) Take the case that every vector can be represented as MU. Then the unit 
vectors E, can be represented as 

E i = MU/. (i = 1,2,.. .,n) 

By linearity, for all numbers c \, C 2 ,..., c n we have 


M!(ciUi + C 2 U 2 + • • • + c n \J n ) — C 1 MU 1 + C 2^^2 + * * * + c^M^U; 
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— CiEi+C 2 E 2 H- \-C n \L n . (1-42) 

The right side is nonzero unless all the numbers q are zero. Therefore so is the left 

side. It follows that M(ciUi + C 2 U 2 H- 1 -c n \J n ) is nonzero unless all numbers q 

are zero. Consequently c\X}\ +C 2 U 2 H- \-c n \J n is nonzero unless all the numbers q 

are zero. This shows that the vectors U/ are linearly independent. By Theorem 1.9, 
every vector W is some linear combination C 1 U 1 + C 2 U 2 + • • • + c n XJ n of the linearly 
independent vectors Ui,... ,U W . It follows from equation (1.42) that MW = 0 only 
if all the numbers q are zero. But then W = 0. This proves part (b). □ 

In Problems 1.92, 1.93, and 1.94 we ask you to justify some additional results 
regarding matrices and determinants: 

(a) For square matrices A and B of the same size, 

det(AB) = det(A)det(B). 

(b) If the n x n matrix A has an inverse matrix A -1 , then 

AA -1 = I„ = A -1 A 
and 

det(A _1 ) = (det(A)) -1 . 


Problems 


1.84. Evaluate the products. 


(a) 

(b) 

(c) 

(d) 


1 00 
020 
006 


xi 

*2 

X 3 


3 -4 

4 3 


6 

6 


cos # - sin # 
sin# cos# 


1 

0 


cos# - sin# 
sin# cos# 


cos# - sin# 
sin# cos# 


cos# - sin# 
sin# cos# 


1.85. Evaluate the products, where 



1 


-3 


0 1 

X = 

, Y = 

1 

, A = 


3 



-1 0 


B = 


b u b\ 2 
bn b22 


(a) AX (b)X-Y (c)Y-(AX) (cl) E,-BE ; , uj= 1,2 

1.86. Evaluate the products, where 
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1 


-3 


0 1 0 


b\\ b\ 2 b\ 2 

x = 

2 

, Y = 

0 

, A = 

1 00 

, B = 

b 2 \ b 22 b 22 


3 


1 


007 


b 2 \ b 22 Z733 


(a) AX (b) X • Y (c) Y • (AX) (d) E* • BE ; , i, j= 1,2,3 

1.87. Show that the function defined in equation (1.40) is linear. 


1.88. Express these formulas as a product AX of a matrix A and a column vector X. 


(a) 

(b) 

(c) 

(d) 


X\ + X 2 
X\ ~X2 


where X is in R 2 


X 2 - 4 ^ 3 , where X is in R 3 
X 2 - 4 x 3 , where X is in R 4 
X 2 - 4x 3 


X\ +X 4 


where X is in R 4 . 


Xi + X 2 + X 3 + X 4 


1.89. Express these formulas as X • (AY), where X and Y are column vectors of R 2 
and A is a matrix. 

(a) xiyi + 2x 2 y 2 

(b) xiy 2 -x 2 yi 

(c) x\ (yi + 3 y 2 ) + x 2 (yi - y 2 ) 


1.90. Let F and G be linear functions from R n to R m , let H be a linear function from 
R m to R*, and let c be a number. Show the following. 

(a) F + G is a linear function. 

(b) cF is a linear function. 

(c) HoF is a linear function. 


1.91. Let N be an m by k matrix, M a k by n matrix, and X in W 1 . Show that the 
product N(MX) can be found by multiplying X by the matrix with i j entry 


k 

^ ^ W'ihWlhj') 
h= 1 


the dot product of row i of N with column j of M. That is, 


N(MX) = (NM)X. 


1.92. Assume A is an n x n matrix with det A ^ 0, and consider the function of n x n 
matrices B defined by 


/(B) = 


det(AB) 
det A 


Show that / satisfies the three basic properties that characterize the determinant. 
Thus by uniqueness /(B) must be equal to detB. Conclude that 


det(AB) = det(A)det(B). 
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1.93. Assume A and B are n x n matrices with det A = 0. Show the following: 

(a) There is a vector W such that for all V, AV + W. (See Theorem 1.20.) 

(b) There is a vector W such that for all U, ABU ^ W. 

(c) det(AB) = 0. 

Therefore det(AB) = det(A)det(B) holds when det A = 0. 

1.94. Prove the following. 

(a) If the nxn matrix A has an inverse, then A A -1 = A -1 A = \ n . 

(b) Use the formula det(AB) = det(A)det(B) to prove that det(A _1 ) = (det A) -1 for 
every invertible matrix A. 


1.9 Geometric applications 

Lines, planes, and hyperplanes. We have noted that for two vectors U and V in 
R 2 , and all numbers c, the points 

U + cV 

lie along a line through U and parallel to V. If U and V are vectors in R", the same 
expression describes a line in R”. See Figure 1.26 for the case n- 3. 



Fig. 1.26 Left: A line in R 3 . Center: A two-dimensional plane in R 3 . Right: A plane specified as 
in Example 1.28. 


Let U, Vi, and V 2 be three vectors in R 3 , Vi and V 2 linearly independent, and let 
s and t be numbers. For each of the points U + sX\ on the line though U parallel to 
V 1 , there is a line U + sX\ + t \2 through U + sY 1 parallel to \ 2 . The set of points 

u+^Vi + ^y 2 

is a plane in R 3 . The same expression represents a two-dimensional plane in any 
number of dimensions greater than three. 
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In the case of R 3 there is an alternate description for a plane, given by specifying 
a normal vector N perpendicular to the plane. 

Example 1.28. Let U = (1,2,3) and N = (2,6, -3), and consider the plane con¬ 
taining U and perpendicular to N. That is the set of points X = (x,y,z) that 
satisfy 

N • (X - U) = 0. 

For U and N specified above this equation can be written as 
2(x - 1) + 6(y - 2) - 3(z - 3) = 0 

or as z = -f + \x + 2 y. See Figure 1.26. Since N • (U - U) = N • 0 = 0, U 
satisfies the equation. If X and Y satisfy the equation, then the vector X - Y is 
orthogonal to N, 

N-(X-Y) = N-(X-U-(Y-U)) = N-(X-U)-N*(Y-U) = 0-0 = 0. 


□ 

When U and Vi, V 2 , • •., V w -i are vectors in R n and the Y j are linearly indepen¬ 
dent, we have called the set of all vectors of the form 


U + t\\\ H-h t n -\\ n -\ 

a hyperplane. A hyperplane in R n can also be defined, given a point U in the hyper¬ 
plane and a nonzero vector N orthogonal to the plane, by the equation 


N- (X-U) = 0. 


The distance from a point V in R n to that hyperplane is ||V - U||cos# where 6 is the 
angle between N and V - U. (See Figure 1.27.) In the case where N is a unit vector 
that is 

distance =N-(V-U), (||N|| = 1). (1.43) 



Fig. 1.27 Distance from point to hyperplane in equation (1.43). 


Cross product in R 3 . For a pair of vectors V and W in R 3 we introduce the cross 
product , denoted as V x W. For all column vectors U, V, and W in R 3 , the array 


58 


1 Vectors and matrices 


[UV W] 

is a 3 x 3 matrix. For given vectors V and W let 

£(U) = det[UVW]. (1.44) 

Since the determinant is a multilinear function of its columns, £(U) is a linear func¬ 
tion of U. According to the representation theorem for linear functions, £(U) is of 
the form 

£(U) = U-Z (1.45) 

Z some vector. Since the linear function £(U) is determined by the vectors V and 
W, so is the vector Z. We call this vector Z the cross product of V and W, 

Z = Vx W. (1.46) 

Combining (1.44), (1.45), and (1.46), we get that for all vectors U, V, and W 

det[U V W] = U • (V x W). (1.47) 

Since the determinant on the left side of (1.47) is a multilinear function of U, V, and 
W, so is the right side. It follows that 

The cross product VxW as defined hy (1.47) is a bilinear function ofY and W. 
We ask you in Problem 1.103 to derive from (1.47) the following formula for the 
cross product. 

Definition 1.24. The cross product of two vectors 

v = (V1,V2,V 3 ), W = (wi,w 2 ,w 3 ) 

in R 3 is defined as 

Vx W = (v 2 W 3 - V 3 W 2 ,-(ViW3 - V 3 Wi),ViW 2 - V 2 Wi). 


Example 1.29. Let i = (1,0,0), j = (0,1,0), and k = (0,0,1). Then 

ixj = k, j xk = i, kxi = j. 


□ 

The formula for V x W in Definition 1.24 can be found by formally computing 
det[U, V, W] where U is the symbolic vector [i, j,k]. 

The cross product has the following properties. 
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Theorem 1.21 .(a) V x W is orthogonal to V and to W, 

V • (V x W) = 0, W • (Vx W) = 0. 

(b) The signed volume of the ordered tetrahedron with vertices 0,U,V,W is 
^U-(VxW). 

(c) W x V = -V x W 


Proof, (a) If we set U = V or U = W in (1.47), the left side is zero, because 
the determinant of a matrix that has two equal columns is zero. Therefore the right 
side is zero too. 

(b) We have shown in Section 1.7 that ^det[U V W] is the signed volume of the 
ordered tetrahedron OUVW. Since det[U V W] = U • (V x W), part (b) follows. 

(c) If we interchange V and W on the left side of (1.47) the sign changes by 

properties of determinant. Part (c) follows from this observation. □ 


We can give now four related uses for cross products and determinant. See 
Figure 1.28. Let U, V, and W be vectors in R 3 . 

• The volume of the parallelopiped determined by U, V, W is |U • (V x W)|. 

• The area of the parallelogram determined by linearly independent vectors V and 

Vx W 

W is ||V x W||. To see this let U be ————. The volume of the parallelopiped 

l|VxW|| 

determined by U, V, W is ||V x W|| and its height is 1. Therefore the area of its 
base is ||V x W||. 

• The area of the triangle with vertices 0, V, and W is \ IIV x W||. 

• Imagine that at each point in space the velocity of a fluid is U = (u\ , U 2 ,u^). Con¬ 
sider a parallelogram determined by vectors V = (vi , V2, V3) and W = (w \, W2, W3) 
as shown in Figure 1.28. Then U • (V x W) is the volume of fluid that flows across 
the parallelogram in one time unit. This is called the volumetric flow rate or flux. 


Define a unit vector orthogonal to the parallelogram as N = 


VxW 

IIVxWH 


; we get 


flux = U • (V x W) = U • N||V x W||, 


Example 1.30. Find a plane containing three noncollinear points U,V, and W. 

(a) We can express the plane as the set of all points 

X = U + s(V - U) + t(W - U) 
as .v and t vary over all real numbers. 

(b) A normal vector to the plane is N = (V - U) x (W - U), so we get an equation 
for the plane, 

N • (X - U) = 0. 


□ 
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Example 1.31. Let U = (1,1,-2) [length/time] be the velocity of a fluid flow. 
Find the volumetric flow rate across the oriented triangles OVW, where V and 
W have length dimensions. 

(a) V = (2,2, -4), W = (3,2,1) 

(b) V = (1,0,0), W = (3,2,1) 

The flow rate through the triangle is half that through the parallelogram, so it 
is • (V x W). In (a), V is parallel to U, and 

jU • (V x W) = 2 det [U 2U W] = 0. 

No fluid flows across the triangle because the fluid velocity is parallel to it. In 
(b) the flow rate is 


^U-(V x W) = ^det 


1 1 3 
1 02 
-2 0 1 


5 

2 * 


The minus sign means that the angle between U and VxW is greater than 
7T/2. □ 

We ask you in Problem 1.106 to explore other properties of the cross product. 


Problems 


1.95. Find equations in the form ax + by + cz = d for the following planes in M 3 . 

(a) The plane through the origin with normal (1,0,0). 

(b) The plane through (0,0,0), (0,1,1), and (-3,0,0). 
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(c) The plane containing the point (1,1,1) and parallel to the plane with equation 
x — 3y + 5z — 60. 

1.96. Which of these points are on the line 2x\ + X 2 = 0 in R 2 ? 

(a) (0,0) (b) (-1,2) (c) cX if X is. (d) X + Y if X and Y are. 

1.97. Which of these points are on the line 10xi + 5 x 2 = 0 in R 2 ? 

(a) (0,0) (b) (-1,2) (c) cX if X is. (d) X + Y if X and Y are. 

1.98. Which of these points are on the line 2x\ +X 2 = 100 in R 2 ? 

(a) (0,0) (b) (50,0) (c) (50,0) + Y if 2 yi +y 2 = 0 (d) cX if X is. 

1.99. Which of these points are on the plane in R 3 given by 20xi + 10 x 2 - 50x3 = 0? 

(a) (0,0,0) 

(b) (0,5,1) 

(c) (-1,2,0) 

(d) cX if X is. 

(e) X + Y if X and Y are. 

1.100. Which of these points are on the plane in R 3 given by 2xi + X 2 - 5 x 3 = 100? 

(a) (0,0,0) 

(b) (50,0,0) 

(c) ( 0 , 100 , 0 ) 

(d) (0,100,0) + Y if 2yi + y 2 - 5y 3 = 0. 

(e) X + Y if X and Y are. 

1.101. Let U = 0, Vi = (0,1,1), and V 2 = (-3,0,0). 

(a) Find an equation of the line through U and Vi in the form X(s) = U + sY\ . 

(b) Find an equation of the line through U and V 2 in the form X(t) = U + t\ 2 . 

(c) Find an equation of the plane through U, Vi, and V 2 in the form 

XCs,0 = U + s-Vi+fV 2 . 

1.102. Which of these points are on the hyperplane in R 4 where x\ + X 2 + X 3 + X 4 = 0? 

(a) ( 5 ,- 5 , 5 ,- 5 ) (b) (a, a, a,-3a) (c) (—1,1) (d)cXifXis. (e)(l,-3,2) 

1.103. Let 


U\ 


Vl 


W\ 

u 2 

v = 

V2 

w = 

w 2 

U 3 


V 3 


w 3 


Write expressions a, b, and c in terms of the components of V and W so that 
det[U V W] = au\ +bu 2 + cu 2 . 

Compare your results to formula (1.47), det[U V W] = U • V x W, to derive a formula 
for V x W. 
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1.104. Evaluate the cross products using the formula 

V X W = (V 2 W 3 - V 3 W 2 ,V 3 Wi - ViW3,ViW2 - V 2 W 1 ). 

(a) ( 1 , 0 , 0 ) x ( 0 , 1 , 0 ) 

(b) ( 0 , 1 , 0 ) x ( 1 , 0 , 0 ) 

(c) ( 0 , 0 , 1 ) x (a,b,c) 

1.105. Using ixj = k, j x k = i, k x i = j, and the distributive and anticommutative 
laws Ux(V + W) = UxV + UxW, VxW = -WxV, evaluate the following cross 
products. 

(a) ( 1 , 0 , 0 ) x ( 0 , 1 , 0 ) 

(b) j x (i + k) 

(c) (2i + 3k) x ( a\ + b\ + ck) 

1.106. Verify the following properties of the cross product in R 3 . 

(a) the product is anticommutative: U x V = -V x U 

(b) the “back-cab” rule: A x (B x C) = B(A • C) - C(A • B), 

(c) the product is not associative: A x (B x C) ^ (A x B) x C. In fact there is a product 
rule similar to the product rule for derivatives: 

A x (B x C) = (A x B) x C + B x (A x C). 

(d) As a function of V, the product U x V is linear, so can be written as U x V = MV 
for some matrix M. Find M. 

(e) ||U x V|| 2 + (U • V) 2 = ||U|| 2 ||V|| 2 . 

(f) Use part (e) and U • V = ||U|| ||V||cos0, where 6 is the angle between the vectors, 
to show that ||U x V|| = ||U|| ||V|| sin0. 

1.107. Find the volumetric flow rate U • (V x W) of a fluid of velocity U through 
the parallelogram determined by V and W. Sketch the parallelogram, and the paral- 
lelopiped determined by U, V, and W. 

(a) U = (2,0,0), V = (0,2,0), W = (0,0,7). 

(b) U = (-2,0,0), V = (0,2,0), W = (0,0,7). 

(c) U = (2,1,0), V = (1,2,0), W = (7,7,7). 


Chapter 2 

Functions 


Abstract The concept of function is central to mathematics. In single variable cal¬ 
culus we studied functions that assign to each number in their domain a number. In 
multivariable calculus we study functions that assign to each vector with n compo¬ 
nents in their domain a vector with m components. 


2.1 Functions of several variables 

We use the notation F : D c R n —> R m for a function F that assigns a vector F(X) 
in R m to each vector X in a subset D of R n , and say that F is a function from R n to 
R m . When the domain D of a function F of n variables is not specified we assume, 
as we have with functions of a single variable, that the domain is the largest set for 
which the definition makes sense. We call the set of outputs F (D) the range of F or 
the image of D. We call F one to one if F(U) = F(V) only when U = V. We say F 
maps D onto a set B in R m if F (D) = B. We usually denote a function whose output 
is a vector by a bold capital letter. A function whose output is a number is called 
a scalar valued, or real valued, function and we usually denote it by a lower case 
letter. 

Iff and F are functions then/(X) and F(X) are values assigned to X. It is some¬ 
times convenient to indicate names of domain and range variables by speaking of “a 
function” r =/( 0 ), U = F(X), V = (u(x,y,t),v(x,y,t)), and so on. 

Definition 2.1. A function F : D c R n — > R m assigns a vector F(X) in R m to 
each X in D, denoted 

F (X) = (fi(X),/ 2 (X),...,/ m (X)). 

The function^- is called the 7 -th component function of F. 


Next we look at some examples of functions. 
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Definition 2.2. A function that assigns the same vector C in R m to each vector 
X in R n is called a constant function. 


Example 2.1. The functions f(x,y) = 1 and G(x,y) = (8,3,2) are examples of 
constant functions. / is a constant function from R 2 to R, and G is a constant 
function from R 2 to R 3 . □ 

Linear functions In Chapter 1 we defined linear functions from R n to R m . We 
review that definition and Theorem 1.19 about representing linear functions. 


Definition 2.3. A function L from R n to R m is linear if 

<zL(U) = L(aU), and L(U + V) = L(U) + L(V) (2.1) 

for all numbers a and all vectors U and V in R n . 


Example 2.2. Is L(x,y) = (2x - 3 y 9 x, 5y) a linear function from R 2 to R 3 ? Let’s 
check to see whether ( 2 . 1 ) holds. 

Let a be a number, and let (x,y) and (m, v) be vectors in R 2 . 

aL(x,y) = a(2x - 3 y,x, 5y) = ( 2 ax - 3ay,ax , 5 ay) = L (ax,ay) = L (a(x,y)). 

Also, 

L (x,y) + L(m, v) = (2x - 3 y,x, 5y) + (2 u - 3v, u , 5v) 

= (2(x + u) - 3 (y + v), v + u , 5(y + v)) = L(x + + v). 

So L is linear. □ 

In Problem 2.4 we ask you to show that a constant function F from R w to R m is 
linear if and only if F(x\,X 2 ,... ,x n ) = 0 . 

Recall that according to Theorem 1.10 every linear function € from R n to R is of 
the form 

t(X\ ,X 2 , ...,Xn) = C\X\ + C 2 X 2 + C 3 V 3 + ... + C n X n , 

which we can write as 

^(X) = C-X, 

for some C = (c \, c 2 ,..., c n ) in R n . 

Let’s look again at why a linear function L from R^ to R m can be represented using 
matrix multiplication. Let Ik be the k -th component function of L, k = 1,2,... ,m. 
Since L is linear it follows that each component 4(X) is a linear function. So by 
Theorem 1.10 there is a vector in R n so that 4(X) = C& • X. Denote the vectors 
Cfc = (cki,Ck 2 , ... Ckn ) as row vectors and denote X and L(X) as column vectors. Let 
C be the matrix whose k -th row is C&. The relations 4(X) = • X, k = 1,2,... ,m 

can be expressed as the product of the matrix C and the vector X: 
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L(X) = CX. 


Theorem 2.1. Every linear function Lfrom R 

n to R m can he written in matrix 

form 

~cn c \2 ■ 

C\n 

~x\ 


L(X) = 

C 21 c 2 2 • 

C2n 

X: 2 

= CX. (2.2) 


Tm\ Cm2 

Cmn- 

An. 


C is called the matrix of L or the matrix that represents L. (See also Theo- 

rem 1.19.) 






We use the following definition to quantify the size of a matrix. 

Definition 2.4. For an mxn matrix C 

= [cij], the norm ||C|| of C is defined by 

IICII = 

m n 

ZZ4 (23) 

i= 1 7=1 


There is an important relation between the norms of X, CX, and C. 

Theorem 2.2. Let C be an mxn matrix. Then 

IICXII < IICIIIIXII 

for every vector X in ML. 


Proof The &-th component of CX is the dot product of the &-th row of C, 

C k ~ (T&l? Ck2'> • • • > Ckn) 

with X = (x\ ,X 2 , . • • ,x n ). By the definition of norm, 

IICXII = A /(C 1 -X) 2 + (C 2 -X) 2 + --- + (C m -X) 2 . 

According to the Cauchy-Schwarz inequality, Theorem 1.12, applied to each com¬ 
ponent this is 

< VllCi|p||X||2 + ||C 2 || 2 I|X||2 + - - - + ||C m || 2 ||X|p 


IIXII VllCiip + ||C 2 || 2 + - • - + Iic m |p = ||X||. 


A ZZ c H C||||X|1 - 

> i= 1 7=1 
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Combining Theorems 2.1 and 2.2 we see that for every linear function L from 

R n to R m 


||LX|| < ||C|| ||X||. 


That is, the norm of the output of a linear function is not greater than ||C|| times the 
norm of the input. 

Functions from R n to R Next we look at some examples of functions from R n to R. 

Example 2.3. The area a of a rectangle that has length x, width y is given by 
a(x,y) = xy. The domain of a is the set D of all ordered pairs (x,y) with x > 0 
and y > 0, a : D c R 2 —> R. Notice that even though the rule for a makes sense 
for all ordered pairs (x,y), we restricted the domain based on the context of 
the problem. □ 

Example 2.4. The volume v of a rectangular box that has length x, width y, 
and height z is given by the function v(x,y,z) = xyz. The domain of v, D , is 
the set of all ordered triples (x,y,z) with x > 0, y > 0, and z > 0. The function 
v : D c R 3 -> R. □ 

Example 2.5. Let g(x,y) = yjx 2 +y 2 , g : R 2 —> R. g(x,y ) is the norm of (x,y), 
g(x,y) = \\(x,y)\\. □ 

Visualizing functions; graphs of functions from R 2 to R. In single variable cal¬ 
culus we visualized a function/ by sketching its graph, the set of all points (x,/(*)). 
For a function from R 2 to R, the graph off is the set of ordered triples (x,y,f(x,y)), 
and often looks like a surface in (x,y,z) space, as in Figure 2.1. 



Next we sketch the graph of a few functions from R 2 to R. 

Example 2.6. Figure 2.2 shows the graph of the constant function f(x,y) = 7. 
□ 

Example 2.7. To sketch the graph of f(x,y) = x 2 +y 2 we notice that the points 
in the domain where f(x,y) = c, c > 0, lie on a circle of radius Vc. The circle 
of radius zero is one point. On the left side of Figure 2.3. we sketched the 
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Fig. 2.2 The graph of the constant function in Example 2.6. 


points in the domain satisfying x 2 + y 2 = c for c = 0,1,4,9. On the right hand 
side of Figure 2.3. we sketched the corresponding points on the graph off. □ 


Definition 2.5. Let/ :DcR w ^Ibea real valued function, and c a number. 
The set of all points (x\,X2,... ,x n ) in the domain D where/ (x\ ,X2, ... ,x n ) = c 
is called the c level set off. 


For functions of two variables we can draw level sets in the domain and plot 
corresponding points (x,y,c) on the graph. This gives an idea what the graph looks 
like, as we did in Example 2.7. The set of points on the graph that correspond to a 
level set f(x,y) -c is called the contour curve of the graph at z-c. 




Fig. 2.3 Left: level sets c - 0, 1,4,9 for f(x,y) = x 2 +y 2 , Right: the graph (cut away for clarity) of 
f(x,y ) with contour curves shown at z— 1,4 and 9. See Example 2.7. 


Example 2.8. Let f(x,y) = ^— ^r, (v,y) + (0,0). Find level set c - ^ 

x 2 +y z x z +y z 

or x 2 -y 2 = c(x 2 +y 2 ) with x 2 +y 2 ^ 0. First we consider some simple choices 

for c. When c = 1 we find that x 2 -y 2 = x 2 +y 2 . Thus the c=l level set is the 

set of all points y = 0, the v axis, with the origin removed. Similarly, the level 

set for c = -1 satisfies x 2 -y 2 = -(x 2 +y 2 ) with x 2 +y 2 ± 0. So this level set 

is the set of points v = 0, the y axis, with the origin removed. For c ± -1 and 

r^Owe can divide by x 2 to get 
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i-4= c (uK 


(y\ z (y\ z 1—c 

Solving for - we get - =-. Thus, each c level set is a pair of lines 

\xJ \xJ 1+c 


1+c 


x and y - 


1+c 


x. So for example, the level set for c = 


- \ is the 


pair of lines y = V3x, y = - V3x, origin deleted. In Figure 2.4 we sketched the 
level sets for c = - \, 0,1 and the graph of / with the corresponding contour 
lines on the graph. □ 



Fig. 2.4 Left: level sets c = -^,0, 1 off(x,y) 
z = 0 shown. See Example 2.8. 


Z 



x - y 

— --. Right: the graph off with contour lines at 

+y z 


Here are some further examples of level sets of a function. 

Example 2.9. Figure 2.5 shows a circular disk, D , and all the points in D where 
the value of a function t(x,y ) is 2, 3, 4, 5, 6, 8, 10 or 10.5. The level sets t- 2 
and t - 10.5 are points. The level sets t - 3, t - 4, t - 6, and t - 8 are curves. 
The level set t - 5 consists of all the points of the shaded band. □ 



Fig. 2.5 Level sets in Example 2.9. 


Example 2.10. Let f(x,y,z) = 


, The domain off is the set of all 


V* 2 +y 2 +z 2 

points (x,y,z) + (0,0,0). The graph off is the set of points (x,y,z,f(x,y,z))- 
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The c level set off for c > 0 is the set of points that satisfy - = c , 

Vx 2 +y 2 + z 2 

or x 2 + y 2 + z 2 = —. It is the sphere with radius - centered at the origin of R 3 . 

c 2 c 

□ 

Functions from R to R n Next we look at examples of functions from R to R n . 

Example 2.11. Let I be an interval [a,b\. Let x, y, and z be functions from I 
to R and let 


P(0 = (x(t),y(t),z(t)). 


Figure 2.6 shows a, t\ , and b on the interval I and the corresponding image 
positions in space P (a), P(0), P (tf), and P (b). □ 



Fig. 2.6 Position P(0 in Example 2.11. 


Example 2.12. Let F (t) = (cost,sint,t), 0 < t < 4n. Figure 2.7 shows solid 
dots as points F (t) where t - 0, |,7t,27t, and 47r, and a curve for the 

other points in the range of F. The resulting curve is called a helix. □ 


0 




Fig. 2.7 The helix in Example 2.12 and its projection (dotted) on the plane. 


Example 2.13. Let F (t) = t( 2,3,4) = (2^3^40- Figure 2.8 shows points in the 
range where t = -. 5,-.25,0,1.1. The range of F is the line that goes through 
the origin and (2,3,4). □ 


Example 2.14. Let G(0 = (1,2,0) + 1(2, 3,4) = (1 -\-2t,2-\-3t,4t). The range of 
G is the line in Figure 2.9 that goes through (1,2,0) and (3,5,4). □ 
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Fig. 2.8 The line in Example 2.13. 



Fig. 2.9 The line in Example 2.14. 


In Examples 2.11-2.14 we sketched the range of the functions. In Examples 2.11 
and 2.12 we sketched the domain of each function showing how a point gets mapped 
to a point in R n . 

Functions from R n to R n Next we consider functions from R n to W\ called vec¬ 
tor fields. One way to visualize a vector field from R 2 to R 2 is to draw an arrow 
representing F(x,y) starting at the point (x,y). Let’s look at an example. 

Example 2.15. Describe the vector field F(x,y) = (-y,x) by sketching a few 
vectors. We first make a list. 


(x,y) 

(1,0) 

(0,1) 

(-1,0) 

(0,-1) 

(2,2) 

(-2,2) 

(-2,-2) 

(2,-2) 

F(x,y) 

(0,1) 

(-1,0) 

(0,-1) 

(1,0) 

(-2,2) 

(-2,-2) 

(2,-2) 

(2,2) 


See Figure 2.10. The vectors (x,y) and F(v,y) are perpendicular since the 
dot product (x,y) • (-y,x) = 0. In fact if we take (x,y) on a circle centered at 
(0,0) we see that F(v,y) has magnitude equal to the radius of the circle, and 
direction tangent to the circle. □ 

Example 2.16. Let 


Vx 2 +y 2 +z 2 V* 2 +)' 2+ 2 2 lx 2 +y 2 +z 2 ) 


F (x,y,z) = 
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Fig. 2.10 A sketch of F(x,y) = (-y,x) in Example 2.15. 


Let X = (x,y,z), then ||X|| = yj'x 2 +y 2 +z 2 and we can express F as 


F(X) = 


X 

iixir 


The domain of F contains every point in E 3 except the origin. Every value 
of F is a unit vector. The range of F is the unit sphere centered at the origin. 
Figure 2.11 shows a sketch of this vector field. □ 




Fig. 2.11 The vector field in Example 2.16. Left : a sketch of two points and the vectors assigned 
by the vector field F. Right: a display of 45 points in the first octant and the vectors assigned by F. 


Example 2. 1 7. Let F be the vector field 


F ix.y.z) = - 


1 

(-\/x 2 +y 2 +z 2 ) 3 


(■ x,y,z). 


that can also be expressed as 


XXI 


F(X) = - 
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To visualize F, plot a point X and then sketch the unit vector — whose 

^ I l-X| | 

direction is opposite X. Adjust the length by the factor 7 ^ 7 ^. We call F the 


inverse square vector field. See Figure 2.12. 


I|X || 2 




Fig. 2.12 The inverse square vector field in Example 2.17. Left : a sketch of a single point (x,y,z) 
and the vector F(x,y,z) assigned to it by F. Right: a display of 45 points in the first octant and the 
vectors assigned to them. 


For a function F from R 3 to R 3 physicists employ the notation 

F (x,y,z) = (f\(x,y,z),f2(x,y,z),Mx,y,z)) =fi(x,y,z)i+f 2 (x,y,z)i+Mx,y,z)k 
where 

i = ( 1 , 0 , 0 ), j = ( 0 , 1 , 0 ), k = ( 0 , 0 , 1 ). 

Similarly, a function from R 2 to R 2 can be written using i = (1,0), j = (0,1) as 

F (x,y) = (fi(x,y),f 2 (x,y)) =fi(x,y)i+f 2 (x,y)i. 

Example 2.18. Let F(v,y,z) = (2,3,4) = 2i + 3j + 4k. F is a constant vector 
field. A sketch of F looks like a field of arrows of the same length all pointing 
in the same direction. See Figure 2.13. □ 



Fig. 2.13 The constant vector field F(x,y,z) = (2,3,4) = 2i + 3j + 4k of Example 2.18. 
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Definition 2.6. Let F be a function from R m to R” and let G be a function 
from R^ to R m . Suppose the range of G is contained in the domain of F. We 
define the composite function F o G from R^ to R n by 

F o G(X) = F(G(X)). 


Example 2.19. Let G(x,y,z) = (x 2 ,y 2 + z 2 ) and let f(u,v) = uv. Then 
f oG(x,y,z) = x 2 (y 2 + z 2 )- 


□ 


Problems 

2.1. Express each linear function as/(X) = C • X for some vector C or F(X) = CX 
for some matrix C. 

(a) /(*i,* 2 ) =x\ +2x 2 

(b) /(*i,*2,* 3 ) = x\ + 2 x 2 

(c) F(xi,x 2 ) = (x\,xi +2x 2 ) 

(d) F(xi,X2,X3) = (X\,X\ + 2x 2 ) 

(e) F(xi,X 2 ,X3,X4) = (X4 -X2,X3 ~X\,X 2 + 5 x\,X\ +X 2 +V3 +X4) 

(f) F(xi,x 2 ) = (xi,5xi,-x 2 ,-2xi,x 2 ) 

(g) F(*i,*2,*3,*4,* 5 ) = (xi,5xi,-x 2 ,-2xi,x 2 ) 

2.2. Let f(x,y) be equal to 1 when (x,y) is inside the unit disk centered at the origin, 
and 0 when x 2 +y 2 > 1. Describe the level sets of f. Sketch a graph of f. 

2.3. Let/(X) be equal to ||X || 2 when X is inside the unit ball ||X|| < 1, and 0 when 
l|X|| > 1. Describe the level sets of/ when the domain is 

(a) R, 

(b) R 3 , 

(c) R 5 . 

2.4. Show that a constant function F : R n —> R m is linear if and only if 

F(x\,x 2 ,...,x n ) = 0. 

2.5. A plane in R 3 with equation z = 2x + 3y is the 0 level set of a linear function € 
from R 3 to R. Find l. 

2.6. The function 

f(x,y) = 5+v + v 2 +y 2 , 
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is the sum of the constant function 5, the linear function x, and the degree 2 polyno¬ 
mial x 2 + y 2 . 

(a) Give two examples of points (. x,y ) near (0,0) for which the approximation by the 
first two parts, 

f(x,y)*5+x 

differs from/(x,y) less than 

(b) Show that 

g(u,v)=f(l+u,2 + v) 

is the sum of a constant function, a linear function of (m, v), and a degree 2 poly¬ 
nomial function of u and v. 

2.7. Let/(X) = ||X|| 2 , X in R 4 . Let A be in R 4 and define 

g(X) = ||A || 2 + 2A-(X-A). 

(a) Let U = X - A. Use the formula 

l|A + U || 2 = ||A || 2 + 2A • U + ||U || 2 

to show that the difference between/(X) and g(X) is ||U|| 2 . 

(b) Show that the difference between /(X) and g(X) does not exceed 10 -4 when 
||X — A|| < 10 -2 . 

2.8. Let L be a linear function from R^ to R m . Show that 

(a) If L(X) = 0 and L(Y) = 0, then L(X + Y) is also 0. 

(b) If L(X) = 0 and c is a number, then L(cX) is also 0. 

2.9. Rework Example 2.8 by assuming y ± 0 rather than x^0. 

2.10. Sketch or describe the c level sets,/(v,y) = c in R 2 , for the following functions 
/ and values c. Use the level sets to help sketch the graph of the function. 

(a) /(x,y) = v + 2 y, c = - 1 , 0 , 1,2 

(b ) f(x,y)=xy, c =-1,0, 1,2 

(c ) f(x,y)=x 2 -y, c = - 1 , 0 , 1,2 

(d ) f(x,y) = Vl-^ 2 -J 2 > c = 0 , i, 1 

2.11. Sketch or describe the c level sets,/(v,y) = c in R 2 , for the following functions 
/ from R 2 to R and values c. 

(a ) f(x,y)=x 2 +y 2 , c = 0 , 1,2 

(b ) f(x,y) = ^x 2 +y 2 , c = 0 , 1 , 2 

(c ) f(x,y) = — --, c- 0, 1,2. Which one is empty? 

2.12. Justify the claim that Theorem 1.20 can be restated as follows: “A linear func¬ 
tion from R n to R n is onto if and only if it is one to one.” 
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2.13. Sketch or describe the c level sets,/(X) = c in R 3 , for the following functions 
/ from K 3 to 1R and values c. 

(a) /(X) = ||X|| 2 , c=\, 1,2 

(b) /(X) = ||X||, c= j,l,2 

(c) /(X) = - 2 — c = i 1,2 

J ||X|| 2 2 

(d) /(X) = U-X, c= j, 1, 2 where U is a unit vector. Hint: write X as a vector parallel 
to U plus a vector orthogonal to U. 


2.14. Let C = 


3 1 
2 4 


, and X = 


1 

2 


. verify the inequality 


||CX||<||C||||X||. 


2.15. Show that for a linear function L from R n to R m , the value L(X) is a linear 
combination of the columns \j of the matrix of L: 

L(X) = x\Y\ h- \-x n Y n . 


2.16. Suppose C is an n by n matrix with orthonormal columns. Use Theorem 2.2 
to show that for every X in R n , 


IICXII < V^IIXII. 

Use the Pythagorean theorem and the result of Problem 2.15 to show that in fact for 
every X in R n , 

IICXII = ||X|| 

for such a matrix C. 

X X 

2.17. For every X ± 0 in R n let F(X) = —- and let G(X) =-- and denote their 

11-2^11 ll^ll 

norms by/(X) = ||F(X)|| and g(X) = ||G(X)||. Describe the level sets 
/(X) = 1, g(X) = 1, g(X) = 2 , and g(X) = 4 
in R n . Are there any points in R n where/(X) = 2? 

2.18. Consider a function F (t) = (1 - 1 )A + where A and B are in R 2 . 

(a) Express F (t) as the sum of A and a multiple of B - A. 

(b) For what value of t is F(t) = A? B? the midpoint ^(A + B)? 

(c) For what interval of t are the points F (t) on the line segment from A to B? 

2.19. Consider a function given by F (t,6) = (x(t,0),y(t,0)) from R 2 to R 2 such that 
for each fixed 6 , as t varies from 0 to 1 , F (t,6) runs along the radius of the unit 
circle centered at the origin from ( 0 , 0 ) to (cos 6, sin#). 

(a) Write a rule for F (t,Q). 
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Fig. 2.14 The plane and sphere in Problem 2.20. 


(b) What is the image of the rectangle 0<f<l, |0|<1? 


2.20. Figure 2.14 shows the unit sphere centered at the origin 

x 2 +y 2 +z 2 = 1 

in R 3 , with the x,y plane viewed edgewise. In this problem we introduce a corre¬ 
spondence between the x,y plane and the sphere with the North Pole (0,0,1) deleted, 
known as stereo graphic projection. 

(a) The line segment from the North Pole to (v,y,0) in the x,y plane can be 
parametrized as 

(1 - 0 ( 0 , 0 ,1) + t(x,y, 0), 

with 0 < t < 1. Find t in terms of x and y, so that this point is on the sphere. 

(b) Conclude that the function 

S(x,y) = - \ -y C 2 x,2y,x 2 +y 2 -l) 

1 + x z +y z 

maps the plane onto the sphere, missing the North Pole. 

(c) Which points of the x,y plane correspond to the upper hemisphere? the lower? 

(d) Which points of the x,y plane correspond to the equator? the South Pole? 

(e) Show that the function S -1 given by 

S - 1 C?i,s 2 ,.? 3 ) = -r~~ ( s i> s 2 )> 

1 -S 3 

defined for points (^i, ^ 2 , ^ 3 ) of the sphere other than the North Pole, is the inverse 
function of S. 


2.21. Show that the linear function 


L (x,y,z) = (x,z,-y) 
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maps the unit sphere centered at the origin into itself. If S is the stereographic pro¬ 
jection defined in Problem 2.20, conclude that the composition 

S _1 oLoS 


maps the x,y plane to itself. See Figure 2.15. Where does the right half-plane (x > 0) 
go? 


2 . 22 . 

some 


The gravity force on a particle at point X in R 3 , due to a mass at the origin, is 
negative multiple of 


G(X) = 


X 

iixjp’ 


For points X = A + U near a given nonzero point A, we compare two approximations 
of G given by 


Gi(X) = 2L = G(A)+ 2L = G(A) + Li(U) 

IIAIP ||A||3 

and 

G 2 (X) = G(A) +1 -F--3 A ’ UA ) = G(A) + L 2 (U). 

I l|A|| 3 ||A|p ) 

(a) Show that L] and L 2 are linear functions of U. 

(b) Take A = (1,0,0) and U = (-^,0,0). Show that the relative errors in these approx¬ 
imations are about 

||G(X)-Gi(X)|| ||G(X)-G 2 (X)|| 

l|G(X)|| ’ l|G(X)|| ‘ 

(c) Take A = (1,0,0) and U = (^,0,0). Show that the relative errors are about 


||G(X)-G 1 (X)|| 


l|G(X) — G 2 (X)|| 


. 0003 . 


I|G(X)|| 


l|G(X)|| 
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2.23. In this problem we use single variable calculus to derive the linear 
approximations for 

X _ A + U 

“(||A|| 2 + 2A-U + ||U|| 2 ) 3 / 2 

given in Problem 2.22. 

(a) Suppose a and u are positive numbers. Use Taylor’s Theorem to show there are 
numbers 6 \ and 62 between zero and u with 

(a 2 + w) _3/2 = a ~ 3 - ^(a 2 + 6 \)~ 5 / 2 u, and 

= a~ 3 -\(a 2 + u)~ 5/2 u + y( a 2 + 62 )~ 1 I 2 u 2 . 

(b) Conclude that 

G(A + U) = (||A||- 3 - |(||A|| 2 + 0i)“ 5 /2 (2A • U + ||U|| 2 ))(A + U), and 
= (||A||- 3 - |(||A|| 2 )- 5/2 (2A - U + HUH 2 ) 

+ f (IIAII 2 + 0 2 r 7/2 (2A • U + ||U|| 2 ) 2 )(A + U). 

(c) Sort those terms into the form 

G(A + U) = G(A) + Li (U) + (large) = G(A) + L 2 (U) + (small) 
where “large” is of order ||U|| and “small” is of order ||U|| 2 . 


2.2 Continuity 

In single variable calculus we motivated the definition of continuity of/ at v by ask¬ 
ing whether approximate knowledge of x is sufficient to give approximate knowl¬ 
edge of fix). This is a very practical question, because we almost always round off 
or approximate the input of a function. We said / is continuous at v means that for 
every tolerance e > 0 for error in the output, we can find a level of precision S > 0 
for the input, so that 


if \x-y\ <5 then \f(x)-f(y)\<e. 


This is also what continuity means for a function F : D c W 1 —> R m . 
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Definition 2.7. A function F : D c R” —> R m is continuous at X in D if for 
every tolerance e > 0, there exists a precision S > 0, that depends on 6 , so that 

if ||X — Y|| < S then ||F(X)-F(Y)|| < e. (YinD) 


Let L : R m —> W 1 be a linear function. We show that L is continuous at every X. 
We need to show that for every tolerance e > 0, there is a precision 6 > 0 so that if 
||Y — X|| < S then ||L(Y) - L(X)|| < 6. Denote Y as X + H. Then 

l|L(Y)-L(X)|| = ||L(Y-X)|| = ||L(H)||. 

By Theorem 2.1 there is a matrix C so that L(H) = CH, and by Theorem 2.2 

l|L(H|| = ||CH|| < ||C||||H||. 

If ||C|| is zero then L is the constant function 0. Constant functions F are continuous 
at each X because ||F(X) — F(Y)|| = 0. So we assume ||C|| ^ 0. Given a tolerance 
6 > 0, take ||H|| < 5 = We get 

l|L(Y) — L(X)|| = ||L(H)|| < ||C||||H|| < ||C||-^- = e. 


Therefore L is continuous at X. 


Definition 2.8. A function F : D c W 1 —» R m is continuous on D if F is con¬ 
tinuous at every X in D. 


Definition 2.9. A sequence of points Xi, X 2 ,..., X&,... in R n converges to X if 
for every e > 0 there is a whole number N so that if k > N, then ||X^ - X|| < e. 


As with functions from R to R, a continuous function takes a convergent sequence 
of points in the domain to a convergent sequence of points in the range. 

Theorem 2.3. IfF : D c R n —> R m is continuous on D, then for every sequence 

x 1? x 2 , ...,x*,... 

of points in D that converges to a point X in D, the sequence 
F(X0, F(X 2 ), .F(Xfc), ... 

converges to F(X). 
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Proof. Take e > 0. Since F is continuous at X, there is S > 0 so that 
if ||X — Y|| < S, then ||F(X)-F(Y)|| < e. 

Since the X^ converge to X, given this S there is a whole number N so that 
if k > N, then ||X-X*||<& 


Therefore 


If k>N, then ||F(X)-F(X*)|| < e. 


This proves that the sequence F(X*) converges to F(X). 


□ 


In Problem 2.44 we ask you to prove the converse of Theorem 2.3. 

The next theorem gives us a tool to reduce the question of continuity of F to 
checking the continuity of the component functions. 


Theorem 2.4. A function F : D c R n —> R m , denoted 
F(X) = (/i(X),/ 2 (X),... ,/ m (X)), 

is continuous on D if and only if each component function fj : D cR” —> R is 
continuous on D. 


Proof Suppose F is continuous at X. For every tolerance e > 0 there is a precision 
6 > 0 so that if ||Y — X|| < 6 then ||F(Y) - F(X)|| < e. Let f be one of the component 
functions. Each component of a vector has absolute value less than or equal to the 
norm of the vector. Therefore 

fi(Y)-fi(X)\ < ||F(Y)-F(X)|| < 6. 

So continuity of F at X implies continuity of each component function f at X. 

Now suppose each component functionis continuous at X. We show that F is 
continuous at X. For every tolerance e > 0, there are m precisions S 1 ,^ 2 ,... ,d m , one 
for each component function, so that if ||Y — X|| < Si then f(Y) -f(X)\ < e. 

Take S to be the smallest of the Si. If ||Y - X|| < S then |/}(Y) -f(X)\ < e for all 
i = 1 , 2 ,...,m, and 

l|F(Y) - F(X)|| = V(/i(Y) -/i(X )) 2 + (f 2 (Y) -/ 2 (X )) 2 + • • • + (f m (Y) -/ m (X )) 2 
< V^ 2 + H-1- 6 2 = sfrne. 


Since e can be chosen as small as we like, this shows that ||F(Y) — F(X)|| can be 
made as small as we like by taking S small enough; this shows that F is continuous 
at X. □ 
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Taking n- 1 in Theorem 2.4 shows that F (t) = ... ,/ m (0) is continuous 

on an interval if and only if each component function f is continuous there. 

Example 2.20. F (t) = (cos t , sinL t) is continuous at t because each component 
function is continuous at t. □ 

The next two theorems help us find continuous functions from R n to R. 


Theorem 2.5. Iff \D c R n —> R and g :D are continuous on D then 

(a) f + g is continuous on D, 

(b) fg is continuous on D, 

(c) ^ is continuous at every point X where g(X) ^ 0. 


Proof, (a) Let X be in D and let e > 0 be a tolerance. There are two precisions, Sf 
and 6 g , so that if ||X - Y|| < 6 f then f(X) -/(Y)| < 6 , and if ||X - Y|| < S g then 
|g(X)-g(Y)|<6. 

Let S be the smaller of Sf and S g . Now if ||X - Y|| < S then f (X) -/( Y| < 6 , and 
\g(X) -g(Y)| < 6 . According to the triangle inequality 

1/(X) -/(Y) + g(X) - g(Y)\ < f (X) -/(Y)| + |g(X) - g(Y)\ < 26. 

Regrouping terms gives |(f + g)(X) - (f + g)(Y)| < 26. Since 26 can be made as 
small as we like, we are done. 

(b) Let 6 be a tolerance and let S be as in the proof of part (a). By algebra we have 
\f (X)g(X) -f(Y)g(Y)\ = f (X)g(X) -f(X)g(Y) +/(X)g(Y) -/(Y)g(Y)|. 

By the triangle inequality and properties of absolute values, 

V (X)g(X) -/(Y)g(Y)| < f (X)| |g(X) - g(Y)\ + |g(Y)| f (X) -/(Y)|. 

If ||X - Y|| < S we have by continuity off and g that 

V (X) -/ (Y)| < 6 and |g(X) -g(Y)| < e. 


Therefore 

\f (X)g(X) -/(Y)g(Y)| < e(f(X) | + |g(Y)|). 
We also know by the triangle inequality (see Problem 1.67) that 

||£(X)Hs(Y)||<|g(X)-g(Y)|<e 
so |g(Y)| < |g(X)| + e. Therefore 


\f(X)g(X)-f (Y)g(Y)| < eQf (X)| + |*(X)| + e). 
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For a given X,/(X) and g(X) are fixed numbers. Therefore the expression on the 
right side can be made as small as we like by taking e small enough. 

(c) Let g(X) = k ± 0. Since g is continuous at X there is a y > 0 so that for all Y 
with IIX-YII < r, |g(Y)-g(X)| < \\k\. Since g(X) = k, |g(Y)| > | \k\. Now let 
6 > 0. Since g is continuous at X there is a 8 > 0 so that if Y is within 8 of X, 
| g(X) - g(Y)| < 6 . For ||X — Y|| less than the smaller of y and 8, 


1 1 


g(X) - g(Y) 

g(X) g(Y) 


g(X)g(Y) 


|g(X)-g(Y)| ^ g 
lg(X)||g(Y)| 


Since k is fixed and e can be taken as small as we like, we have that - is contin- 

8 


uous at X. 


□ 


Example 2.21. Consider the function from R 3 to R, 


f(x,y,z) 


x 2 +xy-2x + z + 7 
* 2 -y 3 


The numerator is continuous at every (x,y,z) because it is a sum of functions 
that are constants, or linear, or are products of constants and linear functions. 
The denominator is a continuous function for the same reason. According to 
Theorem 2.5 then/ is continuous at every point (x,y,z) where the denominator 
x 2 - y 3 + 0 and z is arbitrary. □ 

Theorem 2.6. If F : D c R n —> R m is continuous on D, g : A c R m —> R is 
continuous on A, and the range of F is contained in A, then the composition 
g o F : D —> R is continuous on D. 


Proof. Let X be in D. Since g is continuous at F(X), for every e > 0 there is a 8 > 0 
so that if ||F(X) — Y|| < 8 , then |g(F(X)) -g(Y)| < e. Since F is continuous at X, we 
can find a precision y > 0 so that if ||X — Z\\ < y, ||F(X) - F(Z)|| < 8. It follows that 
for HX-ZII <y, |g(F(X))-g(F(Z))| < e. □ 


Theorem 2.7. If F : D cR"^ R m is continuous on D, G : A c R m —> R^ is 
continuous on A and the range of F is contained in A, then the composition 
G o F : D c R n —> R k is continuous on D. 


Proof. Denote G(yi, ... ,y m ) = (gi(yi ,... ,y m ), ...,g k (yu... ,y m )). By Theorem 2.4, g t 
is continuous on A, and by Theorem 2.6, gi o F is continuous on D. By Theorem 2.4, 
a vector function whose components are continuous is continuous. Therefore G o F 
is continuous on D. □ 
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Example 2.22. Let F be the function 


F(X) = - 


X 


Rewriting F in coordinate notation, 


X^O. 


F (x,y,z)- 


(x 2 + y 2 + z 2 ) 3 / 2 ’ (x 2 + y 2 + z 2 ) 3 / 2 ’ (x 2 + y 2 + z 2 ) 3 ^ 2 


According to Theorem 2.6 the function (x 2 + y 2 + z 2 ) 3 ^ 2 is continuous. Then 
according to Theorem 2.5 the component 


/lCW) (. X 2+y2 +z 2 ) 3/2 

is a continuous function at every (.x,y,z) in R 3 except the origin, so/i is con¬ 
tinuous on R 3 - {(0,0,0)}. Similarly the components fa and/3 are continuous 
on R 3 - {(0,0,0)}. So by Theorem 2.4, F is continuous on R 3 - {(0,0,0)}. □ 

Example 2.23. Consider 

F(x,;y,z) = (sin(x + y), e xz+y2 , log(xz)). 

2 

The component functions f\(x,y,z) = sin(v + y) and/2(x,y,z) = Q xz+y are con¬ 
tinuous at every (x,y,z), and the component f^(x,y,z) = log(xz) is continuous 
at each point (. x,y,z ) where vz > 0. By Theorem 2.4, F is continuous at each 
point (x,y,z) where vz > 0. Alternatively, let 

H(v,y,z) = (x + y,xz+y 2 ,xz ), G (u,v,w) = (sinw,e v ,logw). 


Then G o H = F. By Theorem 2.7, F is continuous at all points (v,y,z) where 
xz > 0. □ 


Definition 2.10. A curve is the range of a continuous function XTcR^R" 
from an interval I to R n , 

X(t) = (x\(t),X 2 (t),...,x n (t)), t in I. 

The function X is called a parametrization of the curve. If I is a closed interval 
[a,b\ then X(a) and X(b) are called the endpoints of the curve. If X(a) = X(b) 
we say the curve is closed and call the curve a loop. 


Definition 2.11. A subset A of R n is connected if for all points P and Q in A 
there is a curve in A with endpoints P and Q. 
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Example 2.24. X(t) = (2 cos t, 3 sin t, t) for nn [0,4 n] is sketched in Figure 2.16, 
part of an elliptical helix. □ 



Fig. 2.16 The curve in Example 2.24. 


Example 2.25. The curve given by the function 

X(t) = (a\ +c\t,a 2 + C 2 t,a? ) + c 3 t ) 
is a straight line in R 3 . See Figure 2.17. Let 


A = (aua 2 ,a 3 ), C = (ci,c 2 ,c 3 ) 

and rewrite the function as X(t) = A+ tC. Since X(0) = A, the line goes 
through point A. □ 



Fig. 2.17 The curve X(t) = A + £C in Example 2.25. 


We close this section with some useful notions about the geometry of R n and 
important theorems about continuous functions. 


Definition 2.12. An open ball of radius r > 0 in R n centered at A is the set of 
all X in W 1 with 

IIX — A|| < r. 
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Example 2.26. The open ball of radius 2 centered at (4,5,6) is shown in 
Figure 2.18. It includes all the points inside the sphere surface, but not the 
points on the surface, as we indicate using dotted lines. □ 



Fig. 2.18 A sketch of the open ball in Example 2.26. 


Definition 2.13. A point A in D c R n is called an interior point of D if there 
is an open ball centered at A that is contained in D. The interior of D is the 
set of interior points of D. 


Example 2.27. Take S to be the square region in R 2 consisting of all points 
(x,y) where 0 < v < 1 and 0 < y < 1. See Figure 2.19. We show that every 
point of S is an interior point of S. Let r be the smallest of 

x, y, 1 x, l-y. 

Then the open disk of radius r centered at (x,y) is contained in S, so (x,y) is 
an interior point. □ 


y 

i 


o i x 

Fig. 2.19 The region S in Example 2.27 shown in gray. The dotted points are not included in S. 


Example 2.28. Take D to be the unit disk x 2 +y 2 < 1 centered at the origin in 
the x,y plane. We show that every point P whose distance from the origin is 
less than 1 is an interior point of D: Let r = 1 - ||P|| and let Q be a point in the 
open disk of radius r centered at P. By the triangle inequality 
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IIQII = IIQ - P + P|| < IIQ - P|| + IIPII 
<r + ||P|| = l. 

The points that are within r of P are all in D. So P is an interior point. See 
Figure 2.20. □ 



Fig. 2.20 P is an interior point in Example 2.28. 


Definition 2.14. A set D in R n is said to be open if every point in D is an 
interior point. 


Definition 2.15. A point B of a set D in R n is a boundary point of D if every 
ball centered at B contains points that are in D and also points that are not in 
D. The boundary of D is the set of boundary points of D , denoted dD. 


It follows from this definition that a set and its complement have the same bound¬ 
ary points. 

We denote the complement of D as R n - D, and more generally write A - B for 
the set of points in a set A that are not in B. 


Definition 2.16. A set D is called closed if it contains all its boundary points. 
The closure of a set D is the union of the set D and its boundary points. The 
closure of D is denoted as D. 


An open set B that contains the closure C of a set C is called a neighborhood 
of C. 


Example 2.29. The set S in Example 2.27, where 0<x<l, 0<y<lisan 
open set since every point of S is an interior point. S is a neighborhood of the 
rectangle R defined by .2 < x < .4 and .1 < y < .5. See Figure 2.21. □ 
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Fig. 2.21 Left: The open set in Example 2.29 is a neighborhood of rectangle R. Right: The closed 
set in Example 2.30. The boundary points are drawn solid. 


Example 2.30. Take S to be the square region 

0 <x < 1 , 0 < y < 1 , 

shown in Figure 2.21. Every point with x = 0or.x=lory = 0ory = lisa 
boundary point. Because S contains all its boundary points, S is a closed set. 
□ 


Theorem 2.8. The closure D of a set D is a closed set. 


Proof. We claim that a boundary point B of D is a boundary point of D. To see 
this take any ball 27 centered at B. The ball 27 contains a point that is not in D and 
therefore not in D. We must show that 27 contains a point of D. We know that 27 
contains a point A that is in D. Take a ball centered at A of radius so small that the 
ball is contained in 27. Since A belongs to D the small ball is either contained in D 
or contains points of both D and the complement of D. This shows that 27 contains 
points of both D and the complement D. Therefore B is a boundary point of D. □ 

The following result is basic in the geometry of R n . 

Theorem 2.9. The complement of an open set is a closed set and conversely. 


Proof. It follows from the definition that an open set D contains none of its boundary 
points. So all the boundary points of D belong the complement of D. Since a set and 
its complement have the same boundary points, this proves the theorem. □ 


Definition 2.17. A set D in R n is bounded if there is a number b so that 

l|X||<* 


for every X in D. 
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Example 2.31. The set U of points in R 2 that satisfy 

x 2 +y 2 < 1 

is bounded. It is called the open unit disk. □ 



Fig. 2.22 The cylinder surface, boundary of the solid cylinders in Example 2.32. 


Example 2.32. The set S of points in R 3 that satisfy 

x 2 +y 2 < 1 

is the solid circular cylinder of radius 1 centered about the z axis. We ask you 
in Problem 2.35 to show that S is an open set. Let T be the set of points in R 3 
that satisfy 

x 2 +y 2 < 1. 

The boundary of S and of T is the cylindrical surface of radius 1 centered on 
the z axis. See Figure 2.22. T is closed. Neither S nor T is bounded because 
the z coordinate of points can be arbitrarily large positive or negative. □ 

Theorem 2.10. If a sequence of points Xi,... ,X^,... of a closed set C in R n 
converges to a point X, then X is in C. 


Proof. Suppose X is not in C. Then X is in the complement of C, which is open. So 
there is an open ball of some radius r centered at X, that contains no point of C. But 
the points X^ are in C and for k large enough, 

HX*-X||<r. 

This is a contradiction. Therefore X is in C. □ 


The following result is basic. 
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Theorem 2.11. Extreme Value Theorem A continuous function 

f:C cR^R 

defined on a closed and bounded set C attains its maximum value and its 
minimum at some points in C. 


Proof We show first that a continuous function / defined on a closed, bounded 
subset C of R n is bounded , that is, there is a number b so that | f (X)| < b for all X in 
C. We argue indirectly and suppose on the contrary that/ is unbounded. That means 
that for every whole number k there is a point in C such that 

V (Xfc)| > k. (2.4) 

Divide the space R n into ^-dimensional cubes of unit edge. Since C is bounded, it 
is contained in a finite number of these cubes. Therefore there is a unit cube C\ that 
contains infinitely many of these points X^. 

Divide Ci into ^-dimensional cubes of edge Since the number of these cubes 
is finite, one of them, call it C2, will contain infinitely many of the points X&. 

Continuing in this fashion we obtain a sequence of cubes Q of edge length 2 ~ k , 
each contained in the previous one, each containing infinitely many of the points 
X,. 

We choose now a sequence of points Y m as follows: Y m is one of the points X^ 
contained in the cube C m that is not one of the points Y*, k <m previously chosen. 
Since we have infinitely many points to choose from, such a choice is possible. 
The nested sequence of cubes C n have exactly one point in common; call it Y. The 
sequence of points Y m converges to Y. Since the points Y m belong to C, and since 
C is closed, it follows that Y belongs to C. Since/ is a continuous function, 

lim / (Y m ) =f (Y). (2.5) 

m ^ co 

The sequence Y m is a subsequence of X n . Since according to (2.4) the sequence 
f(X n )| tends to infinity, it follows that also /(Y m )| tends to infinity. This contradicts 
equation (2.5). 

We have arrived at this contradiction by assuming in (2.4) that the function / 
is unbounded. Since this assumption led to a contradiction, we conclude that / is 
bounded on C. That is, the values off are a bounded set. Therefore, by the Least 
Upper Bound Theorem referenced in the Preface, there is a least upper bound. 

Let M be the least upper bound of the values off on C. Then for each whole 
number k > 0 the number M - \ is not an upper bound for the values off. Therefore 
there is a point where 

M>f(Z k )>M-\. 


lim / (Z m ) = M. 

m^oo 


This shows that 


( 2 . 6 ) 
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Arguing as above we can show that this sequence has a subsequence that converges 
to a point; call this limit point Z. Since C is closed, Z belongs to C. Since / is 
continuous, it follows from (2.6) that 

/ (Z) = M. 

This proves that/ attains its maximum value. 

Since every / has a maximum, it follows that -/ has a maximum, so / has a 
minimum. This completes the proof of the Extreme Value Theorem. □ 

Uniform continuity. The concept of uniform continuity is basic: 

Definition 2.18. Denote by S' a subset of W 1 . A function F : S —> R m is uni¬ 
formly continuous on S if for every tolerance e > 0, there is a precision S > 0 
so that if X and Z in S are within S of each other, F(X) and F(Z) are within e 
of each other. That is, 

if ||X-Z||<d then ||F(X)-F(Z)|| < e. 


Uniform continuity on S implies continuity at every point of S. Surprisingly a 
converse is true if S is closed and bounded. 


Theorem 2.12. A continuous function F : C c R n —> R m on a closed and 
bounded set C is uniformly continuous on C. 


A proof of this theorem is outlined in Problem 2.40. 

Example 2.33. Let f(x,y) = xy on the square 0 < v < 1, 0 < y < 1. According 
to Theorem 2.12/ is uniformly continuous. □ 


Problems 

2.24. Rewrite the proof of Theorem 2.5, part (a), by justifying the following steps. 

(a) Let X be in D and let e > 0 be a tolerance. Show that there is a 6 > 0 so that if 
l|X —Y|| < 6 then |f(X)-/(Y)| < if and |*(X)-g(Y)| < ±e. 

(b) Show that 

/(X) -/(¥) + g(X) - *(Y)| < /(X) -/(Y)| + |*(X) - *001 < 6. 

(c) Show that this proves / + g is continuous at X. 

2.25. Suppose F(xi,...,x„) = (fi(x\,...,x n ),f 2 (^ 1 ,...,x n )) is a continuous function 
from R n to R 2 , and g is a continuous function from R 2 to R. Prove the following. 
(This is an alternate way to prove parts (a) and (b) of Theorem 2.5). 
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(a) The composite goF is continuous. 

(b) The function g(x,y) = x + y is continuous. 

(c) Suppose/i and /2 are continuous functions from R n to R. Use parts (a) and (b) to 
show that/i +/2 is continuous. 

(d) Suppose /1 and /2 are continuous functions from R n to R. Use part (a) and some 
function g to show that the product/ 1/2 is continuous. 

2.26. Show that the function 


f(x,y,z) 


sin(x 2 + v 2 ) 
e z+ y 


is continuous at all points ( x,y,z )• 


2.27. The slope of the graph of cos(cx) lies in the interval [-c,c]. Fill in the missing 
numbers. 

(a) if \x-a\< (?) then | cos(2v) - cos(2^)| < e. 

(b) if \y - b\ < S then |cos(3y) - cos(3Z?)| < (?). 

(c) if\\(x,y)-(a,b)\\ < (?) then |cos(2v)cos(3y) -cos(2a)cos(3&)| < e. 


2.28. According to the single variable Intermediate Value Theorem a continuous 
function / on a closed interval assumes every value between its values at the end¬ 
points. Suppose now D c R n is a set in which any two points can be joined by a curve 
(D is connected) and let/ : D —> R n be continuous. Justify the following steps to 
show that if y is a number between two values off then y is a value off. 

Suppose A and B are points of D and y a number with 

/(A) < y <f (B). 


(a) Show there is a curve X(t), a < t < b, in D with X(a) = A, X(b) = B. 

(b) The composite/ oX is continuous on [a,b]. 

(c) / (X(a)) <y <f (X(b)). 

(d) There is a number t\ in [a,b\ such that/(X(?i)) = y. 

2.29. A function/ from R 2 to R is continuous in the disk x 2 +y 2 < 1, the maximum 
value off is 10 and/(l,0) = 10, /(0, \) = -10. Which are true? 

(a) f(x,y) = 0 at some point of the disk. 

(b) / has a minimum value on the disk. 

(c) -10 is the minimum value off on the disk. 

(d) If x 2 + (y — \) 2 is small enough then f(x,y) < -9.98. 

2.30. A function/ from R 3 to R is continuous on an open set that contains the cube 

where 0 <v< 1 , 0 < y < l, 0 <z< 1 , the maximum value off on the cube is 10 and 
/(0, 1) = 5. Which are true? 

(a) / has a minimum value on the cube. 

(b) /( 0 , 0 , 0 ) is the minimum value off on the cube. 
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(c ) f(x,y,z) = 2n at some point in the cube. 

(d) / could be 10 at two points. 

(e) If x 2 + (y - \) 2 + (z - l) 2 is small enough then f(x,y,z) > 4.98. 

2 . 31 . A function/ from R 3 to R is continuous on R 3 . Show that these functions are 
continuous. 

(a) 10 + xf(x,y,z) onR 3 

(b ) f(x,x,y) on R 2 

(c) f(x\X 2 ,X 2 X 3 ,X 3 X 4 ) on R 4 

2 . 32 . A continuous function/ : (a,b) —> R on an open interval does not necessarily 
have a maximum or minimum value. 

(a) Give an example of a continuous function / : (0,1) —> R with arbitrarily large 
values. 

(b) Give an example of a continuous function ^ : (0,1) —> IR that is bounded but does 
not attain a maximum or minimum value. 

2 . 33 . Sketch the graph off(x\,X 2 ) = X 2 on x 2 +x^ < 2 and find the maximum value 


of/. 


2 . 34 . Which of the following sets are bounded? 

(a) The points of R 3 where x 2 +y 2 + z 2 = 25. 

(b) The points of R 3 where x 2 + y 2 - z 2 = 1. 

(c) The points of R 2 where x< \ and y < 1. 

2 . 35 . Show that the set x 2 + y 2 < 1 in R 3 is an open set. 

2 . 36 . Let S be the set R 2 with the origin removed. Show that 0 is a boundary point 
of 5. 

2 . 37 . Let T be the triangular region in R 2 defined by v > 0, y > 0, and v + y < 1. 

(a) Describe the boundary of T. 

(b) Show that the point (.0001, .9998) is an interior point of T. 

2 . 38 . State the domain of each function. Is the domain closed? bounded? Is / con¬ 
tinuous? Does / have a maximum? a minimum? 

(a)/(X) = e -||X|12 where X is in R 2 
(b = (47rr)- 1/2 e- j:2/4f 

(c) /(X,0 = (47it)“" /2 e- ||X||2/4f where X is in R". 

(d) /(xi,X 2 ,X 3 ,X 4 ,X 5 ) = - 



2.39. Consider the linear function 
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(a) Find a number c so that ||F(X)|| < c||X||. 

(b) Find a number d so that ||F(X) - F(Y)|| < d\\X - Y||. 

(c) Is F uniformly continuous? 

2.40. In this problem we prove Theorem 2.12. Suppose F : C c R n —> R m is contin¬ 
uous on a closed and bounded set C. Either F is uniformly continuous on C, or it is 
not. Justify the following statements that show that the statement that F is not uni¬ 
formly continuous on C leads to a contradiction, hence F is uniformly continuous. 

(a) Since/ is not uniformly continuous there is some tolerance e and a sequence of 
pairs of points X*, in C, k- 1,2,3,... such that ||X^ - Y^|| < \ and 

||F(X*)-F(Y*)||>e. 

(b) As we saw in the proof of Theorem 2.11, the sequence X^ must have a subse¬ 
quence Xfc. that converges to a point X in C. Then ||X&. - Y^.|| < ^ and since F is 
continuous 

lim F(X*.) = F(X). 

k[ —*°o 

(c) Use the triangle inequality 

l|X-YfcJI < ||X-X^.|| + ||Xfc f - Y^.|| 

to show that the Y&. also converge to X. 

(d) The sequences F(X^) and F(Y^) both converge to F(X). 

(e) That contradicts ||F(X^.) -F(Y*!)|| > 6. 

(f) F is uniformly continuous on C. 

2.41. Let C be a vector in R n and let X and Y be vectors in R n . Use the Cauchy- 
Schwarz inequality 

|A-B|<||A||||B|| 


to prove: 

(a) the function/(X) = C • X from R n to R is uniformly continuous, 

(b) the function g(X, Y) = X • Y from R 2 ^ to R is continuous. 

2.42. Let/(X) = ||X|| and g(X) = ||X|| 2 , for X in R". 

(a) Give examples of points X and Y one unit apart from each other, such that 

|g(Y)-g(X)|> 10 60 . 

(b) Show that/ is uniformly continuous. 

(c) Show that g is not uniformly continuous. 

2.43. Consider the function f(X) = —in the set D in R n where ||X|| > 2. Use the 

||X|| 

identity 
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J_1 _ l|Y|| — ||X|| 

l|X|| ||Y|| ||X||||Y|| 


to show that / is uniformly continuous on D. 

2.44. Suppose F is a function from R n to R m with the following property. For 
every sequence Xi,X 2 ,X 3 ,... that converges to X in the domain of F, the sequence 
F(Xi),F(X 2 ),F(X 3 ), ... converges to F(X). Justify the following steps to show that 
F is continuous. 

(a) If F is not continuous at a point A, then there is some e > 0 so that for every 8 
there is a point B with 


||A-B|| < 8 and ||F(A)-F(B)|| > e. 


(b) If F is not continuous at a point A, then there is some e > 0 so that for every 
integer k > 0 there is a point with 


HA-X*||<I and ||F(A)-F(X*)||>e. 


(c) If F is not continuous at a point A, then there is a sequence Xi,X 2 ,X 3 ,... con¬ 
verging to A such that the sequence F(Xi),F(X 2 ),F(X 3 ), ... does not converge to 
F(A). 


2.3 Other coordinate systems 

Polar coordinates. It is often convenient to use polar coordinates (r, 6) rather than 
rectangular coordinates (x,y) to describe a curve or a region in the plane. Figure 2.23 
shows the coordinates of a point in a plane given in terms of rectangular coordinates 
and polar coordinates, where we usually take r > 0 and 0 <6 <2n. 


y axis 



0 


polar axis, x axis 


Fig. 2.23 Polar and rectangular coordinates 
The coordinates are related by 


x = rcosO 
y - r sin6. 


(2.7) 
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and 

r = ^ jx 2 + y 2 (2.8) 

Let F : R 2 —> R 2 be the function defined by 

F (r,6) = (rcos#,rsin$). 

The component functions of F are x = rcos#, y = rsin6. F is called the polar 
coordinate mapping. Let’s see how regions in the (r,0) plane are mapped to the 
(x,y) plane. See Figure 2.24. If we restrict F to a rectangular region 0 < a < r < b, 
0 < a < 6 < J3 < 2n, then F is one to one. Three such regions are indicated in the 
figure. 


27r 


tt/2 
7t/4 
0 

0 12 3 

Fig. 2.24 The polar coordinate mapping. 



Cylindrical coordinates. An alternative to Cartesian coordinates (x,y,z) in space 
uses polar coordinates in the x,y plane and retains the z coordinate. 


Definition 2.19. The cylindrical coordinates (r,0,z) that correspond to Carte¬ 
sian coordinates (x 9 y,z) are related by 

x = rcosO 
y = rsin# 

z = z 

r > 0 and 0 < 0 < 2n. 


Example 2.34. The point with Cartesian coordinates (v,y,z) = (V2, V2,3) has 
cylindrical coordinates (r, 6, z) = (2, |, 3). □ 
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Fig. 2.25 The region in Example 2.35. 


Cylindrical coordinates can simplify the description of some regions in space. 
Example 2.35. Let D be the region described in cylindrical coordinates by 

1 < r < 2, 0<#<7r, 3 < z < 4. 

A sketch of this region is shown in Figure 2.25. Using Cartesian coordinates 
D is described by 


0<y, 1 <x 2 +y 2 <4, 3 < z < 4. 


□ 

If we think of the function 

F(r,0,z) = (rcos#,rsin#,z) 

from R 3 to R 3 , we can see that a solid rectangular bar in (r,0,z) space, described by 
the inequalities in Example 2.35, is mapped to the region D there. If we extend the 
bar to 0 < 6 < 2n, then we get a full ring. Describing a region as the range of some 
mapping will be useful later when we study the integral. 

Spherical coordinates. Another way to describe the location of a point (x,y,z) is 
to denote by p the distance between the point and the origin, let 0 be the angle 
between the positive z axis and the line through (0,0,0) and (x,y,z), and let 6 be the 
angle between the plane containing (x,y,z) and the z axis, and the x,z plane. See 
Figure 2.26. 
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Definition 2.20. The spherical coordinates (p , 0, 6) that correspond to the 
Cartesian coordinates (x 9 y,z) are related by 

x = psin0cos# 
y = p sirup sin# 
z - pcos 0. 

0 < p, 0 < 0 < 7T, 0 < 0 < 27T. 



Fig. 2.26 The spherical and cylindrical coordinate systems. 

Example 2.36. The set of points in space that lie on a sphere of radius 3 cen¬ 
tered at the origin satisfy 

0 2 +;y 2 + z 2 = 3. 

Using spherical coordinates we describe these points by 

p = 3 

and 0 and 6 take their full range: 0 <6 < 2tt, 0 < 0 < n. □ 

Example 2.37. To describe the set of points on the circular cone surface 

z = yj , x 2 +y 2 

shown in Figure 2.27 we note that every point on the cone has the same angle 
0. To find 0 we look at the intersection of the cone with the plane y = 0. 
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At such points we get z = Vx 2 = \x\, so 0 = and p and 0 are unrestricted. 
Another approach is to substitute the formulas x = p sin 0cos 6 , y = p sin0sin0 
and z = p cos0 into the equation for the cone: 


p cos 0 = yp 2 sin 2 0 cos 2 0 + p 2 sin 2 0 sin 2 6 
= p sirup. 

Since 0 < 0 < 7r this is satisfied only by 0 = |. □ 

Example 2.38. Let D be the solid region of points that satisfy 

x 2 +y 2 +z 2 <9, x 2 +y 2 +z 2 > 1, z>0 

shown in Figure 2.28. The surface of the inner hemisphere is sketched with 
dots to indicate that the points on that surface are not included. Region D is 
described in spherical coordinates as 

1 <p < 3, O<0<|. 


□ 



Fig. 2.28 The region in Example 2.38. 
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Problems 


2.45. Consider the polar coordinate mapping illustrated in Figure 2.24. Sketch a 
region in the r, 6 plane that corresponds to the upper half plane y > 0. 

2.46. Sketch the region in the x,y plane that corresponds to the polar coordinate 
rectangle 1 < r < 2, 0 < # < 7r. Find the polar coordinates of the point (. x,y ) = (0,1.5). 

2.47. Use equations and inequalities to describe the following sets in polar coordi¬ 
nates. 

(a) The open unit disk x 2 + y 2 < 1. 

(b) The first quadrant x > 0, y > 0. 


2.48. Consider the set of points in the x,y plane whose polar coordinates satisfy 

1 


0 <6 < 2n, r = 


2 +sin# 

Show that this is a bounded set in the x,y plane. 


2.49. Let 0 < b < a and consider the set of points in the plane whose polar coordi¬ 
nates r, 6 satisfy 

1 

r — -. 

a + #cos# 

(a) Show that the x,y coordinates of the points satisfy 

1 = a yjx 2 +y 2 + bx. 

(b) Show that the equation 1 - a y\x 2 +y 2 + bx is the equation of an ellipse by com¬ 
pleting a square and getting the equation into the form 

( x-a ) 2 (y~P ) 2 

c 2 d 2 ‘ 

2.50. Make a sketch to show the image of a rectangle ro < r < ro + h, 0 <6 <2n 
under the polar coordinate mapping. Find the area of the image rectangle. Show that 
the ratio 

area(image) 

area(rectangle) 

tends to ro as h tends to 0. 


2.51. Match the formulas (i-iv) in spherical coordinates with the descriptions (a-d) 
of sets in R 3 . Note that if we don’t specify any restrictions on a coordinate, then the 
usual ones apply: p > 0, 0 < 0 < n, 0 < 6 < 2n. 

(i) p < 1 (ii) 0 = y (hi) 0 <6 <n (iv) p > 1 and | < 0 
(a) a half-space 
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(b) an open ball 

(c) a cone surface 

(d) a half-space with points removed that were within one unit of the origin 

2.52. Let D be the region in R 3 where the spherical coordinate p is restricted: 

2 <p < 4, 

and let P = (0,0,1), Q = (7,0,0), R = (3,0,0) in Cartesian coordinates. 

(a) Describe the region D. 

(b) Is R in D1 

(c) There is a point A in D that is closest to P, and a point B in D that is farthest from 
Q. Use a sketch to find A and B. 

2.53. Consider functions 

s\(p, (p,6) = e _p , S2(p,(f>,0) = (1 -p)pe _p , d$(p,(p,6) = (3 cos 1 (p- 1 )p 2 e _p , 

of the type used to describe orbitals of an electron in an atom. (See also Section 9.5.) 

(a) What is the maximum value of s\, and what do the level sets of s\ look like for 
values less than the maximum? 

(b) Which of the three functions is zero on some sphere centered at the origin? 

(c) What are the limits of s\, S 2 , and J 3 as p tends to infinity? 

(d) J 3 is nonnegative on the z axis and in a region enclosed by a double cone sur¬ 
rounding the z axis. Sketch the region. 

(e) Is J 3 positive or negative outside the region of part (d)? 

2.54. For a function that is a product f(x,y) = g(x)h(y) of nonnegative functions g 
and h , show that the maximum off is the product of the maximum values of g and 
of h, if these exist. Use this idea in spherical coordinates to find the locations of the 
maximum values of 

d%(p,(p,Q) = (3 cos 2 (p- 1 ) 2 p 4 e -2p . 

2.55. We outline a proof of the Fundamental Theorem of Algebra: For every poly¬ 
nomial with complex coefficients 


p(z)=po+piz+P2Z 2 + ---+p n z n , (n > 0) 


there is a complex number z where p(z) = 0 . 

Recall that complex numbers z = x + iy can be identified with points (x,y) in R 2 , 
together with the multiplication 

(x + i y)(u + iv) = xu - vy + i (yu + xv). 

Thus polynomials with complex inputs can be viewed as functions from R 2 to R 2 . 
Prove the following statements. 
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(a) Multiplication is continuous, so z 2 , z 3 , etc., are continuous functions, so polyno¬ 
mials are continuous functions. 

(b) Complex numbers have square roots, cube roots, fourth roots, etc: use the fact 
that z = (x,y) can be written r(cos# + i sin 6) in polar coordinates, and that multi¬ 
plication is given by 

ri(cos#i +isin 0 i)r 2 (cos #2 + isin# 2 ) = nJ" 2 (cos( 0 i + # 2 ) + isin(#i +62))- 

(c) The absolute value |z| = ||(v,y)|| is a continuous function of x and y and, using 
part (b), \zw\ = |z||w|. 

(d) |p(z)| tends to infinity as |z| tends to infinity. To see this, first use the triangle 
inequality to show that for |z| > 1 


\Pn-lZ H 1 +'"+Pol <P\Z\" 1 


for some number P; second use the triangle inequality 

\P0 + ■ ■ ■ +PnZ n | > \p n Z n | - \Pn-\Z n ~ X + ■■■ +A)I 


to show that for |z| > 1, \p(z)\ > \p n \\z\ n - P\z\ n ~ l , that tends to infinity as |z| tends 
to infinity. 

(e) Now a proof by contradiction: If a polynomial p does not have a root then the 
function 


f(z) = 


1 

\p(z)\ 


is a continuous function from R 2 to R that tends to zero as |z| tends to infinity. 
Such a function/ has a maximum value at some number a. Therefore |p(z)| has 
a minimum value \p(a)\ = mi^O. 

(f) For every polynomial q(z) of degree n and every number a , q{z) can be expressed 
as a polynomial in z-a. 


q(z) = q(a) + q\a)(z - a) + q"(a) 


(z-fl ) 2 

2 ! 


+ --- + q in \a ) 


(z-a) n 

n\ 


(g) Use part (f) to express 


p(z) = p{a) + c(z - a) k h— 

where c ± 0 and the dots are powers of z~a greater than k. According to part (b) 
there is a &-th root h\ 

hk = _pM. 

c 

Then use 

p(a + eh) = p(a)( 1 -e k )-\— 

where the dots are powers of e greater then k. This shows that 
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| p(a + eh )| < m( 1 - e*) 4— < m 
for e small, a contradiction. 

2.56. Suppose two points in R 3 have cylindrical coordinates 
Show that the distance between the points is given by 

^ r l +r 2~ 2rir2 -0i) + (Z2- Z \) 2 • 

2.57. Suppose two points on the unit sphere centered at the origin in R 3 have spheri¬ 
cal coordinates (1, 01 , 6 \), (1, 02 , 02 ) • Show that the dot product of the points is given 
by 

cos (02 - 0 i)-sin 02 sin 0 i (1 -cos (02 ~ 0 i))- 



Chapter 3 

Differentiation 


Abstract In this chapter we introduce the notion of derivative of functions of sev¬ 
eral variables. We start with functions of two variables and then extend to several 
variables. By using vector and matrix notation we find that many of the concepts 
and results look familiar. 


3.1 Differentiable functions 

Recall that for a function / of a single variable x, we say that / is differentiable at a 
if / is locally linear at a. That is, there is a constant m so that for all h sufficiently 
close to zero the change in / at a , 


f(a + h)-f(a). 


is well approximated by mh. By “well approximated” we mean that the difference 
between f(a + h)~ f(a ) and mh is small compared to h when h is small. That is, 


(f(a + h)-f(a))-mh 
h 


tends to zero as h tends to zero. In this case we say / is differentiable at a and we 
write 


lim 

h—>{) 


f(a + h)-f(a) 
h 


= m. 


The number m is called the derivative of / at a , denoted f'(a ), and the linear approx¬ 
imation of / at a is 

f(a) + f'(a)(x-a). 

Now let’s extend the notion of “local linearity” to functions from R 2 to R. First 
we recall from Theorem 1.2 that a linear function € from R 2 to R is a function of the 
form 


© Springer International Publishing AG 2017 

P. D. Lax and M. S. Terrell, Multivariable Calculus with Applications, 
Undergraduate Texts in Mathematics, https://doi.org/10.1007/978-3-319-74073-7_3 


103 




104 


3 Differentiation 


£(h,k) = ph + qk , (3.1) 

where p and q are some numbers. We recall Definition 1.5 of the norm of a vector 
ii(/a)ii = V&2+R 

Definition 3.1. A function / defined in an open disk in R 2 centered at (a, b) is 
differentiable at ( a , Z?) if 

f(a + h,b + k)~ f(a , Z?) 

can be well approximated by a linear function £ in the following sense: 

(f(a + h,b + k)-f(a,b))-€(h,k) 

W(hM\ ( j 

tends to zero as ||(Zi,&)|| tends to zero. 


We call 

L(x,y ) = f(a , b) + £(x -a,y-b) 

the linear approximation of f(x,y ) at (a,b). 

Definition 3.1 can be rewritten in vector language. Let A = (a,b) and H = (h,k). 

Definition 3.2. (Vector notation version) A function / from R 2 to R defined 
on an open disk centered at A is differentiable at A if /(A + H) - /(A) can be 
well approximated by a linear function £ in the following sense: 

(/(A+ H) - /(A)) -1(H) 

mu 

tends to zero as ||H|| tends to zero. 


Theorem 3.1. If a function f from R 2 to R is differentiable at A then f is 
continuous at A. 


Proof By Definition 3.2, 


/(A + H) - /(A) - £ (H) 

tends to zero as ||H|| tends to zero. Since ^(H) tends to zero as ||H|| tends to zero, so 
does /(A + H) - /(A). □ 

Next we relate the numbers p and q in the linear function £(h,k ) = ph + qk in 
formula (3.2) to the function /. 
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Suppose / is differentiable at ( a,b ) and £(h,k) = ph + qk. Set k- 0 in Defini¬ 
tion 3.1; then 

lim f(a + h,b)-f(a,b)-ph _ Q 
h^O h 

so f(x, b ) is a differentiable function of a single variable v at a, and 

/(a + h,b)~ f(a , Z?) 


p = lim : 

/z—>o 


Zi 


The number is called the partial derivative of / with respect to x at (< a , Z?) and is 
denoted 

^f(a,b) or f x (a,b). 

OX 

Thus ^/-(a,b) is found by holding y equal to the constant b and differentiating f(x, b) 
ox 

with respect to v at a. Similarly if we let h = 0 in Definition 3.1 we see that if / is 
differentiable at ( a,b ) then f(a,y) is a differentiable function of y at b and 


l . f(a , b + k) - /(a, b)-qk 

lim —-——-—— = 0 

o & 


so that 


= lim 


f(a,b + k)-f(a,b ) 
k 


The number g is called the partial derivative of / with respect to y at (a, Z?) and is 
denoted 

or fy(a,b). 

This shows that if / is differentiable at (a,b) then it has partial derivatives f x (a,b ) 
and f y (a,b ) there, and the linear approximation of / at (a,Z?) is 


£(*,30 = f(a, *) + fx(a, b)(x - a) + / y (fl, Z?)(y - Z?). 


The rules for finding partial derivatives follow from the rules for ordinary differ¬ 
entiation. 

(a) (f+ g)x = fx + gx and (f + g)y = fy+gy 

(b) (fg)x = f x g + fgx and (fg) y = f y g + fg y 

(c) (?T=-^ and (?) y 

Example 3.1. We show that f(x,y ) = xy 2 is differentiable at (1,3). First we find 
f x ( 1,3) and f y (l,3). Holding y fixed and differentiating / with respect to v we 
get f x = y 2 , f x ( 1,3) = 9. Holding v fixed and differentiating / with respect to 
y we get f y = 2xy, f y ( 1,3) = 6. Now that we have p and q we check that / is 
locally linear at (1,3), where £(h,k) = 9h + 6k. 
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f(l+h,3 + k)-f (1,3)-f(h,k) _ (1 + h)(3 2 +6k + k 2 ) - (1)(3) 2 - (9h + 6k) 

P,*)ll “ P,*)ll 


k 2 + 6 hk + hk 2 

P,*)ll ' 

By the triangle inequality | k 2 + 6 hk + hk 2 \ < k 2 + 6\hk\ + \h\k 2 . Since we are 
taking the limit as ||(/z,&)|| tends to zero we can restrict attention to where 
\\(h,k)\\ < 1, and there \h\<\. We get 

| k 2 + 6 hk + hk 2 \ < (1 + \h\)k 2 + 6\hk\ < 2 k 2 + 6\hk\. 

Since ( h ± k) 2 = h 2 + k 2 ± 2hk > 0 it follows that 2\hk\ <h 2 +k 2 . Therefore if 

||(/i,&)|| < 1 we have 

\k 2 + 6hk + hk 2 \ ^ 2k 2 + 6\hk\ ^ 2k 2 + 3h 2 + 3k 2 ^ 5 (h 2 +k 2 ) _ / 7 ? 

||(ft,*)|| - |KA,*)H - mM “ ll(A,*)ll V + ' 

Therefore as ||(/i,^)|| tends to zero 

f(l + h,3 + k)-f (1,3)-£(h,k) 

M 

tends to zero, and f(x,y ) = xy 2 is differentiable at (1,3). □ 

The next example shows that the existence of partial derivatives, while necessary 
for differentiability at a point, is not sufficient. 

Example 3.2. Define a function / as 

f(x,y) = \x + y\-\x-y\. (3.3) 

The single variable functions, 

fix, 0) = |jc + 0|-|jc-0| = 0, W,y) = |0 + y|-|0-y| = 0, 

are both constant and therefore differentiable at (0,0), and 
fx( 0,0) = 0, f y ( 0,0) = 0. 

Next we show that 

/(0 + h, 0 + k) - /(0,0) - €(h, k ) 

M 

does not tend to zero as ||(/i,&)|| tends to zero. Take k-h\ then 

/(0 + h, 0 + h) - m 0) ~ (Oh + Oh) _ \2h\-\0\-0-(0h + 0k) _ 2y_ 
yfjfi+k 2 yflh 2 V2 
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does not tend to zero as ||(/i,/i)|| tends to zero. Therefore / is not differentiable 
at (0,0). □ 

Next we show that if / has continuous partial derivatives on an open set contain¬ 
ing ( a,b ) then / is differentiable at (< a,b ). We use the following theorem. 


Theorem 3.2. A Mean Value Theorem. Let f be a function from R 2 to R 
whose partial derivatives f x and f y exist on an open set containing ( a,b ). 
Then for each ( h,k) with ||(/i,&)|| sufficiently small there are numbers h' and k' 
where a + h' lies between a and a + h, and b + k' lies between b and b + k, such 
that 

f (a + h,b + k)- f(a,b) = hf x (a + h',b + k) + kf y (a,b + k'). (3.4) 


(a, b + k) 


(cl + h, b + k) 


n 

(a, 6) 


(a + h,b + k) 



(a + h',b + k) 
(a, b + k') 


Fig. 3.1 Left: a small rectangle in U, in Theorem 3.2. Right: points used in the proof. 


Proof Write 

f(a + h, b + k) - f(a , b) = f(a + h, b + k) - f(a , b + k) + f(a , b + k)- f(a, b). 

For \h\ and \k\ sufficiently small all the points on the sides of a small rectangle (see 
Figure 3.1) are in U and f(x,b + k) is differentiable on the closed interval from a to 
a + h. We apply the Mean Value Theorem for the single variable function f(x , b + k) 
and conclude that there is a number a + h' between a and a + h for which 

f(a + h, b + k) - f(a , b + k) = hf x (a + h',b + k). 

Similarly f(a,y) is differentiable on the interval between b and b + k , and by the 
Mean Value Theorem for single variable functions there is a number b + k' between 
b and b + k for which 


f(a , b + k)- f(a , b) = kf y (a , b + k'). 


Add these two equations, and we obtain formula (3.4) to complete the proof. 


□ 
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Theorem 3.3. If the partial derivatives of f are continuous in an open set 
containing (« a , b) then f is differentiable at (a, b). 


Proof Using formula (3.4) of Theorem 3.2 

f(a + h,b + k)- f(a , b) - ( hf x (a , b) + kf y (a , b)) 

V/z 2 + k 2 

hf x (a + h' ,b + k) + kf y (a , & + &')) - ( hf x (a , Z?) + /^(a, £>)) 

Vft 2 + & 2 

h(f x (a + h',b + k)-f x (a,b)) + k(f y (a,b + k')-f y (a,b)) 

where /z' is between 0 and /z and k' is between 0 and By the triangle inequality the 
absolute value of the last expression is less than or equal to 

7 jp , !/,(* + A', ft + *) - /,(*, ft)| + . f „ k(a, ft + ) - f y (a, b)\. 

Vft 2 +£ 21 1 Vft 2 +£ 2 ' 1 

Since ^ and ^ are less than or equal to 1, and f x and ^ are con- 

V/z 2 + & 2 V/z 2 + k 2 

tinuous, each of the terms in this expression tends to zero as ||(/i,&)|| tends to zero. 
This proves that (3.5) tends to zero as ||(/i,&)|| tends to zero. This shows that / is 
differentiable at (a,b). □ 

Example 3.3. Let f(x,y ) = y + sin(xy) + sinh v. Use the linear approximation of 
/ at (0,1) to approximate /(. 1, .9). The linear approximation of / at (0,1) is 

Ux,y) = m 1) + fx(0, l)(x - 0 ) + f y { 0 , l)(y - 1). 


We calculate 

/(0,1) = 1 + sin(0) + sinh(0) = 1 + 0 + \ (e° - e“°) = 1 

f x (x,y) = cos(xv)) ! + cosh x, f x (0, 1) = cos(0)l + |(e° + e _0 ) = 2 

f y (x,y) = 1 + cos(xy)v, f y ( 0,1) = 1 +cos(0)0 = 1. 

Therefore L(x,y) = 1 + 2(x - 0) + l(y - 1) and 


L(.l,.9) = 1+2(.1) + 1(.9-1) = 1 + .2-.1 = 1.1 


is a good approximation of /(. 1,.9) = 1.09005.... □ 

Partial derivatives and differentiability, F : R n —> R m . We extend the definitions 
of differentiability, local linearity, and partial derivative. For functions / : R n —> R 
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we define partial derivatives by differentiating with respect to the i -th variable while 
holding the others fixed: 

— = lim---. 

OXi h-+ o h 

Let F be a function R n —> R m whose component functions 

¥(x\,...,x n ) = (f\(x\,...,x n ),fi(x\,...,x n ),...,f m (xi,...,x n )) 

are differentiable. Each function fi from R n —> R has ft partial derivatives denoted 
or . We arrange the partial derivatives in an m by n matrix 



i<A) - 



t<A> 

i<A) - 


(3.6) 

t(A) 

&*) - 



at A. 





D¥(A) = 


Using vectors and matrices we can express the definition of differentiability of F 
at A. 


Definition 3.3. A function F from W 1 to R m defined on an open set U contain¬ 
ing A is differentiable at A if F(A + H) - F(A) can be well approximated by a 
linear function La in the sense that 

||F(A + H) - F(A) - L a (H)|| 
l|H|| 

tends to zero as ||H|| tends to zero. 


By an argument similar to the one we gave for /:R 2 ^Rwe can show that if F is 
differentiable at A then F is continuous at A. We can also show that the component 
functions have partial derivatives, and that 

L a (H) = DF(A)H 

where DF(A) is the matrix (3.6) of partial derivatives. We ask you to justify the 
steps to show this in Problem 3.9. 

Example 3.4. Let f(x,y,z) = x 2 sin(yz). The partial derivatives of / are 
fx = 2xsin (yz), f y = zx 2 cos(jz), /, = jx 2 cos(yz), 


and 
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fx( l,f,2) = 0, f y (l, |,2) = -2, / z (l, |,2) = 

Therefore 

D/(l,f,2)) = [0 -2 -§]. 


□ 


Example 3.5. Let F(v,y) = (x 2 + y 2 ,x, -y 3 ). Find Z)F(l,-2). 


24 

dx 

d/2 

dx 

% 

dx 


= 2x, 
= 1, 
= 0 , 


f=^ 

f = o, 

f =-3/- 


At the point (1,-2) we have 


DF(l,-2) 


2 -4 
1 0 
0 -12 


□ 


Definition 3.4. For n = m the matrix derivative (3.6) of F at A is a square 
matrix. Its determinant is called the Jacobian of F at A and is denoted as 
/F(A): 

7F(A) = detDF(A) (3.7) 


Just as the derivative f'(a) can be thought of as a local stretching factor by /, 
- as x - a tends to zero, the geometric meaning of the Jacobian is local 


x-a 

magnification of volume by F. That is, denote by B r (A) the ball of radius r centered 
at A, and by C r (A) its image under the mapping F. The ratio 


Vol(C r (A)) 
Vol (2? r (A)) 


tends to |/F(A)| as r tends to zero. 

Example 3.6. Let F(v,y) = (x 2 +y,y 3 + xy). Find the Jacobian of F at (1,2), and 
interpret it as a local magnification of area. 


DF(*,y) 


2x 1 
y 3 y 2 +x 


DF(1,2) 


2 1 
2 13 


/F(l,2) = det 


2 1 
2 13 


= 24. 


The area of the image F(i? r ) of a small disk B r of radius r centered at (1,2) is 
about 24 times as large as the area of B r . □ 

If a function X from R to R n is differentiable at t , then by definition 

X(t + h)-X(t) v//x 


h 
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tends to zero as h tends to zero. That means X'(t) is the limit of secant vectors 
divided by h. Figure 3.2 illustrates that X'(0 is tangent to the curve at X(t). 

If we think of X(t) as the position of a particle at time t , then X'(t) is the velocity 
of the particle at time t and ||X'(0II is its speed. 



Fig. 3.2 ^ + ^ tends to X'(0 as h tends to 0. 


Example 3. 7. Let X(r) = (cos t, sinr, t). The matrix derivative of X at t is 

-sin£ 

DX(t) = X\t)= cos t 
1 


Thinking of X'(t) as the velocity vector at time t we may write 
X\t) = (-sin^cosr, 1). 

Its speed is ||X'(0ll = V s i n 0 2 + ( cos 0 2 + 1 2 = V2. □ 

Definition 3.5. The vector of partial derivatives of a function / : R n —> R is 
denoted 

V/ = (/ x ,,...,/ x J or grad/ 
and is called the gradient of /. 


Example 3.8. Let f(x,y,z) = x 2 sin(yz). By the calculations in Example 3.4, 

V/(v,y,z) = (2vsin(yz),zv 2 cos(yz),yv 2 cos(yz)) 

and V/(l, |,2) = (0,-2,-|). □ 

Using the gradient notation the linear approximation of /(A + H) at A can be 
written using the dot product 

/(A + H)*/(A) + V/(A)-H. 
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Definition 3.6. A function F from an open set U of R n to R m is called contin¬ 
uously differentiable if it has partial derivatives that are continuous functions 
on U. A continuously differentiable function is called a C 1 function. 


In Theorem 3.3 we have shown that a function of two variables that has contin¬ 
uous partial derivatives is differentiable; the analogous theorem holds for functions 
of several variables. 


Theorem 3.4. If F from R n to R m is C l on an open set U then F is differen¬ 
tiable at each point of U. 


In Problem 3.10 we show you how to extend the proof of Theorem 3.3 to prove 
Theorem 3.4. 


Problems 

3.1. Find the partial derivatives. 

(3(x + h) 2 +4y)-(3x 2 +4y) 

(a) lim- 7 - 

h^O h 

(b) lim (3* 2 +4(y + fc))-(3x 2 +4y) 

k —>0 k 

3.2. Let f(x,y) = x 2 + 3 y. Find the linear approximation of f(x,y) near (2,4), and use 
it to estimate /(2.01,4.03). 

3.3. Find the indicated partial derivatives of the functions. 

(a) f x (x,y) and / y (2,0) if f(x,y) = e^ 2 

(b) f(xe y +ye x ) 
dy 

d I d , A 

(c) ^ |cos(at) + —(sm(xy)) I 

3.4. Let f(x,y) = x 2 y 3 . Let (x,y) = (a + u, b + v). Use binomial expansion on the right 
side of 

x 2 y 3 = (a + u) 2 (b + v) 3 
to find the numbers ci, C2, C3 in 

f(x,y) = ci + c 2 (x - a) + c 3 (y - b) + • • • 

where the dots represent polynomials in (x - a) and (y - b) of degree 2 or more. 
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(a) Express c\ in terms of /, a , and b. 

(b) Express C 2 and C 3 in terms of partial derivatives of / at ( a,b ). 

(c) Find functions t and s so that 

f(a + u, b + v) = f(a , Z?) + v) + s{u , v), 

and £ is linear, and s(w,v) is small compared to ( u,v ) as (w,v) tends to ( 0 , 0 ). 

3.5. Two mathematics students are discussing the values of (1 + v + 3 y) 2 for small x 
and y. They find two linear functions i\ and 1 2 to help estimate the values, 

(1 +x + 3;y ) 2 = (1 +x + 3j)(l +x + 3y) 

« ( 1)(1 + v + 3y) = 1 + v + 3y, and 

hix,y) 

(1 + v + 3y ) 2 = 1 + 2x + 6 y + 6xy + x 1 + 9y 2 
« 1 + 2v + 6 y. 


(x,y) 

(.1,-2) 

(.01,.02) 

(l+x + 3 y) 2 
1+x+3 y 

1 + 2x + 6y 




Fill in the table of values, and observe that some linear functions track small 
changes better than others. 

3.6. Consider a linear function b(x,y,z) = ax + by + cz. Show that W is constant. 
Show that when a , b , c are not all zero W is normal to the level set € = 0. 

3.7. Consider two linear functions t(x,y) = x + 2y and m(x,y) = -3b(x,y). Sketch 
level sets l - -1,0,1 and m - -1,0,1. For which function are these more closely 
spaced? Determine the gradient vectors V^(x,y) and Wm(x,y). For which function 
are these vectors longer? 

3.8. Fet X be in R n . Find the gradients. 

(a) V(2||X|| 1/2 ) 

(b) vf-nxir 1 ) 

(c) v|i||X|rj, r* 0 

3.9. Suppose a function F from R n to R m is differentiable at A. Justify the following 
statements that prove 

L a H = DF(A)H, 

that is, the linear function L A in Definition 3.3 is given by the matrix of partial 
derivatives DF(A). 
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(a) There is a matrix C such that La(H) = CH for all H. 

(b) Let Q be the i -th row of C. The fraction 

||F(A + H) - F(A) - L a (H)|| = ||F(A + H) - F(A) - CH|| 
l|H|| ||H|| 

tends to zero as ||H|| tends to zero if and only if each component 

l|H|| 


tends to zero as ||H|| tends to zero. 

(c) Set H = hEj in the /-th component of the numerator to show that the partial 
derivative f^ Xj ( A) exists and is equal to the (/, j) entry of C. 

3.10. Justify the following steps to prove Theorem 3.4, that a function with contin¬ 
uous first partial derivatives is differentiable. In parts (a)-(d) we suppose / : R n —> R 
has continuous first partial derivatives at all points in a ball of radius r centered at 
point P. In parts (e)-(f) we assume the components of F : R" —> R m have continu¬ 
ous first partial derivatives at all points in a ball of radius r centered at point P. Let 
H be a vector with ||H|| < r. 

(a) 


f(? + U)-m = f(p 1 +h 1 ,...,p n + h n )-f(pu...,p n ) 

= f(pi +huP2 + h 2 ...,Pn + h n )-f(pi,P2 + h 2 ,...,Pn + K) 

+ f(pi,p 2 + h 2 ,p3 + h3,...,p n + h n )-f(p\,p 2 ,p3+h3,...,p n + h n ) 

+ ... 

+ f(PUP2,---,Pn-UPn+h n )-f(puP 2 ,---,Pn-UP n )- 
(b) There are numbers 0 < h' t < hi such that 

f(Pu- • •, Pi-u Pi + hi,p i+ 1 + h i+ 1,. ..,p n + h n ) 

Pi-1 , Pi,Pi +1 + hi+i ,..., Pn + hn) 

- hifxt(pi , ■ • •, Pi-l,Pi + h\ , p i+ 1 + h i+ i ,..., Pn + hn). 


(c) /(P + H)-/(P) = ^h i f Xi (pi,...,p i -i,p i + h' i ,p i+ i+h i+ i,...,p n + h n ). 


i= 1 


(d) 


/(P + H)-/(P)-H-V/(P) 

mm 


tends to zero as H tends to 0. 


(e) Given e > 0 there are numbers r,- so that if ||H|| < r,, then 
|/KP + H)-/(P)-V/KP)-H| 


IIHII 


< e. 


(f) Let r be the smallest of r\,...,r m . Then if ||H|| < r. 







3.2 The tangent plane and partial derivatives 


115 


||F(P + H)-F(P)-DF(P)H|| 

m 


< em. 


3.11. Define functions / and g from R 2 to R by 

f(x,y) = cos (x + y), g(x,y) = sin(2 x-y). 
Find the gradients V/ and Vg, and show that 


fx~fy = 0 , g x + 2g y = 0 . 

3.12. Let f\(x,y ) = e x cosy, fi(x,y) = x 2 -y 2 . Find V/i and V/ 2 , and show that 


dx 


(/*) + 


d_ 

dy 


(fy) = 0 


for each of /1 and 

3.13. Let g(x,y) = z ax+b y , where a and b are some numbers. Find the gradient of g. 

3.14. Let a, b , and c be some numbers, and define 

/(x, y , z) = sin(< 2 x + by + cz). 

Let C = (/?, r) be a vector such that ap + bq + cr = 0. Show that C • V/ = 0, that is, 


Pfx + <lfy + r/z = 0. 


3.2 The tangent plane and partial derivatives 

The geometric interpretation of the derivative f'(a ) of a function / of a single vari¬ 
able is the slope of the line 


y = f(a) + f'(a)(x-a) 

tangent to the graph of the function. There is a similar geometric interpretation of the 
partial derivatives of a function of two variables. Suppose / is differentiable at (< a , b ), 
with partial derivatives f x (a,b) and f y (a,b ) at (a,b). The geometric interpretation of 
linear approximation is that the graph of 

z = L(x,y) = f(a , b) + f x (a, b)(x -a) + f y (a , fc)(y - b) 

is the plane tangent to the graph of / at the point (a,b,f(a,b)). Rewriting the equa¬ 
tion 

fx(a, b)(x -a) + f y (a,b)(y-b) + (-l)(z- f(a, b)) = 0 
we see that a normal to the tangent plane is the vector 


(3.8) 
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N = (f x (a,b),f y (a,b),-l). 

If / is differentiable at ( a,b ) we say N is normal to the graph of / at (a,b,f(a,b)). 



Fig. 3.3 A graph partly cut away for clarity at its intersections with planes x = a and y = b. 


Another way to obtain an equation for the plane tangent to the graph of / at (a, b) 
is to intersect the graph of / with the planes x = a and y-b (see Figure 3.3), and use 
the lines tangent to the resulting curves to determine the tangent plane. 

In the plane x - a an equation for the line tangent to the curve at (a,b,f(a,b)) is 

c\(t) = (a,b,f(a,b)) + t( 0,1 ,f y (a,b)). 

Similarly, an equation for the line tangent to the intersection curve in the plane y-b 
is 

c 2 (s) = (a,bJ(a,b)) + s(l,0J x (a,b)). 

As we saw in Section 1.9 a parametric equation for the plane determined by these 
two lines is 


P (s, t ) = (a, b, f(a , b)) + s( 1,0, f x (a, b)) +0,1, f y (a, b)). 
A normal to this plane is the cross product of the two tangent vectors, 


(1,0 ,f x (a, b)) X (0,1, fy(a, b)) = ( -f x (a , b), -f y (a, b), 1) 


which is a normal to the tangent plane given by equation (3.8). 

Example 3.9. Let f(x,y ) = x x ^y 1 ^. 

(a) Find an equation of the plane tangent to the graph of / at (1, -1), and 

(b) Use the linear approximation of / at (1, -1) to estimate /(.9, -1.1). 

We have f y (x,y ) = ^v 1 / 2 y _2/3 and f x (x,y ) = ^v _ 1 / 2 y 1/3 . Since f x and f y are 
continuous near ( 1 ,- 1 ), /is differentiable at ( 1 ,- 1 ). 

/(!,-!) = - 1 . /,(!.-!) = 3 , £( 1 ,- 1 ) = 4 - 


(a) An equation for the plane tangent to the graph of / at (1, -1) is 
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z = /(l,-l)+/ x (l,-l)(x-l)+/ y (l,-l)0;-(-l)) = -l-i(x-l)+i( 3 ;+l), 


or 3v-2y+ 6z + 1 =0. 

(b) To approximate /(.9, -1.1) we use the linear approximation 

L(x,y) = /(1,-1)+ / X (l,-l)(x- 1) + f y (l, — l)(y — (—1)). 

L{. 9,-1.1) = -1 - i(.9- 1)+ i(—1.1 + 1) = -1 + 20 - 35 = --9833..., 
a good approximation of /(.9, -1.1) = -.9793.... 


□ 


Example 3.10. Find all points on the graph of 

f(x,y ) = x 2 -2xy-y 2 + 6x-6y 

where the tangent plane is horizontal. The function / has continuous partial 
derivatives f x = 2x-2y + 6 and f y = -2x - 2y - 6, so / is differentiable at every 
point. A tangent plane to the graph of / at (x,y) is horizontal if f x (x,y ) = 0 
and f y (x,y) = 0. The equations 2x - 2y + 6 = 0 and -2x - 2y - 6 = 0 have the 
solution y = 0 and x = -3. There is one point on the graph with horizontal 
tangent plane, (-3,0, -9). □ 

If two C 1 functions / and g have the same value f(a,b) = g(a,b) and the 
same partial derivatives at (, a , b ), then their graphs have the same tangent plane at 
(a,b,f(a,b)), and we say that the graphs are tangent at that point. 

Example 3.11. To find points of tangency of the graphs of 

f(x,y) = x 2 -2xy-y 2 
g(x,y) = x 2 -3xy + 4x- 16 

and the common tangent plane we look for points where the normal vectors, 
(2x - 2y, —2x - 2y, -1) and (2x - 3y + 4, -3x, -1), are multiples of each other. 
Since the third components are equal this only happens when 

2x-2y = 2x-3y +4 
-2x-2y — -3x, 

which are satisfied by y = 4 and x = 8. We verify that the point (8,4, -16) is 
on both graphs. Therefore there is a common tangent plane 

z = -16 + 8(v-8)-24(y-4) 


at the point (8,4,-16). 


□ 
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Problems 

3.15. Find an equation for the plane tangent to the graph of f(x,y ) = x + y 2 , 

(a) at (x,y) = (0,0), 

(b) at (x,y) = (1,2). 

3.16. Let f(x,y) = ^/l-x 2 -y 2 . 

(a) Sketch the graph of /. 

(b) Find an equation of the plane tangent to the graph of / at (v,y) = ( ^fA, V^5). 



Fig. 3.4 Cross sections of graphs of / and g in Problem 3.17. 


3.17. Let f(x,y) = e and g(v,y) = —^—-. 

(a) Show that f(a , b) = g(a , b ) at every point (a, b) where a 2 + b 2 = 1. 

(b) Find the gradients V/ and Vg. 

(c) Show that the graph of / is tangent to the graph of g at every point (« a , Z?) where 

+ = 1. 

See Figure 3.4 for cross sections of graphs of / and g. 


3.3 The Chain Rule 

Composition. There are many types of functions of several variables so the formula 
for the Chain Rule takes several forms. Here is one of them. 
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Theorem 3.5. Chain Rule 1 (for curves). Let f from R 2 to R be continu¬ 
ously differentiable on an open set U, and let X(t) = (x(t),y(t)) be a differen¬ 
tiable function from an open interval I to U. Then the composition f(x(t),y(t)) 
is a differentiable function from I to R and 


d / / x dfdx Of dy 

d/ W '>.*'»= + 


The right side can be written as the dot product 


= V/(x(0,y(0)-X / (0. 



Fig. 3.5 Line segments used in the proof of Theorem 3.5. 


Proof Because X is continuous, we know that for h sufficiently small, h± 0, the line 
segments from (x(t),y(t)) to (x(t),y(t + h)) and from (x(t),y(t+h)) to ( x(t+h),y(t+h )) 
in Figure 3.5 are in U. Rewrite 

f(x(t + h\y(t + h))-f{x(t\y(t^ 


as 

= {f(x(t + h),y(t + h)) - f(x(t),y(t + h)j) + (f(x(t),y(t + h)) - f(x(t),y(t)j). 

By the Mean Value Theorem for single variable functions there is a number x* 
between x(t + h) and x(t) for which 

f{x(t + h),y(t + hj) - f(x(t),y(t + h)) = f x (x t ,y(t + h)){x{t + h)~ x{t)). 

Similarly there is a number y* between y(t) and y{t + h) so that 

f(x(t),y(t + h)) - f(x(t),y(t)) = f y (x(t),y*)(y(t + h)~ y(t)). 


Now divide by h; we get 
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f(x(t + h),y(t + ft)) - f(x(t),y(t)) 
h 
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fjxu + h),y(t + ft)) - /(Vr),)V + ft)) /(*(?), y(r +ft))-/(*(?), y(Q) 
ft + ft 

_ / c (x«,y(r + ft))(x(f + ft)-x(0) fy(x(t),y t )(y(t + ft) ->'(f)) 

ft + ft 

a / * ^ » x(f + ft)-x(/) ;y(t + ft)-y(0 

As /t tends to zero---and---tend to x (t) and y (t). Since j x 

h h 

and f y are continuous, f x (x*,y(t + h)) tends to f x (x(t),y(t)) and f y (x(t),y *) tends to 
^(x(0,y(0) as ^ tends to zero. Hence 

^f(x(t),y(t)) = f x (x(t),y(t))x'(t) + f y (x(t),y(t))y'(t) = Vf(x(t),y(t))-X'(t). 


□ 

Example 3.12. ForO<^< 1, X(f) = (*(£),y(0) = (t,2t) is the straight path from 
the origin to point (1,2). Let 


f(x,y) = x 2 +y 4 . 

The derivatives of X and / are X'(0 = (1,2), , V/ = (2v,4y 3 ). By the Chain 
Rule, 


^/(X(0) = V/(x(0,y(0)-X'(0 = (2(t),4(2f) 3 ) • (1,2) = 2f + 64r 3 . 
Alternatively, the composite of / and X is 

f(X(t)) = t 2 + 16t 4 , 

and the derivative is j -(t 2 + 16f 4 ) = 2r + 64r 3 as we found using the Chain 
Rule. f □ 

We proved the Chain Rule for curves in R 2 and functions from R 2 to R. The 
analogous theorem can be proved in n dimensions. 

Theorem 3.6. Chain Rule 1 (for curves). Let f from R n to R have continu¬ 
ous partial derivatives on an open set U, and let X(t) = (vi(0,..., x n (t)) be a 
differentiable function from an open interval I in R into U. Then the composite 
f(X(t)) is differentiable on I and 

^/(X(0) = + + = V/(X(0) • X'(0 = Df(X(t))DX(t). 

at ox\ at uxpi Q t 
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Example 3.13. Let X(t) = (cos t, sinf, t), -oo < t < oo be a curve in R 3 that gives 
the position of a particle at time t. Let 

2 2 2 

f{x,y,z) = e-*-y- z 

represent the temperature of the particle at point (x,y,z). Then the composite 

/(X(/)) = e- (1+ ' 2) 

is the temperature of the particle at time t. The rate at which the temperature 
of the particle changes with respect to time is 

4/(X(0) = -2 te~ (l+t2) . 

at 

We calculate derivatives: the velocity of the particle at time t , 

X'(0 = (-sine cos t, 1), 
and the gradient of the temperature at (x,y 9 z) 

Vf(x,y,z ) = -2 e~ x2 ~ y2 ~ z2 (x,y,z). 

Using the Chain Rule we have that the rate at which the temperature of the 
particle changes with respect to time is 

2/ ( x(0) = v/(X(0)-x'(0 

at 

= -2e~( l+t2 \cost,sint,t)-(-sint,cost,l) = -2 ta~^ l+t2 \ 


Directional derivative. The rate at which a function / from R n to R changes as we 
move from a point P along the straight line X(0 = P + 1 \, -h < t < h is 


r /(X(0)-/(X(0)) d^^, 

hl ^-- = 

*-> o t-\) at 


t =o 


By the Chain Rule for curves 

d 
d t 


/(X(r)) = v/(X(0)-x'(f). 


At f = 0 


V/(X(0))-X'(0) = V/(P)-Y. 
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Definition 3.7. Let / be a C 1 function from an open set in R n containing P to 
R, and let V be a unit vector in W 1 . We call 

V/(P)-V 

the directional derivative of / at P in the direction of V, and denote it D\f(P). 


The directional derivative gives insight into the gradient of / at P. Since ||V|| = 1, 
^v/(P) - V/(P)• V = ||V/(P)||cos0 


where 6 is the angle between V/(P) and V. We see that the directional derivative is 
greatest when cos 6 = 1, i.e., when V is in the direction of V/(P). The unit vector in 
the direction of the gradient of / at P is 


V/(P) 

nv/(P)ir 


Therefore the greatest directional rate of change in / at P is in the direction of the 
gradient and is 


V/(P)- 


V/(P) 

IIV/(P)|| 


= IIV/(P)||. 


Example 3.14. Let f(x,y,z ) = x + y 1 + z 4 . Find the direction and magnitude of 
the greatest directional derivative of / at (3,2,1). 


V/ = (l,2y,4z 3 ), V/(3,2,l) = (1,4,4). 


The greatest rate of change in / at (3,2,1) is 

||V/(3,2,1)|| = Vl 2 + 4 2 +4 2 = V33 = 5.744... 


and this occurs in the direction of (1,4,4). □ 

Example 3.15. Let f(x,y,z) = x + y 2 + z 4 . In which directions is the rate of 
change of / at (3,2,1) zero? From Example 3.14, 


V/(3,2,1) = (1,4,4). 


The directional derivative Dy/(3,2,1) is zero if (1,4,4)-V = 0. That is, if 
we move from (3,2,1) in any direction orthogonal to the gradient, the rate of 
change is zero. □ 

The gradient and level sets. Let k be a number, and consider the level set S given 
by f(x,y,z) = k. We assume 

• a point ( a , b , c) is on S , so f(a , b,c) -k 

• f is C 1 on S 
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• Vf(a,b,c)^0 


We show that V/(<z, b , c) is normal to S. 

Suppose X(0 is a differentiable curve in S that goes through (a,b,c) at t = to. 
Then 


mt))=k. 


and 




t=to 



0 . 


By the Chain Rule, 


s /(xw) 


t=t 0 


V/(X(fo))-X'(/o). 


Combining these two expressions we get 


0 = V/(X(to))-X'(r 0 ). 


Therefore V/(a, b , c) is orthogonal to the tangent vector of every differentiable curve 
in S through (a,b,c). This is what we mean when we say that Vf(a,b,c) is normal 
to S at (a,b,c). See Figure 3.6. The Implicit Function Theorem in Section 3.4 will 
show that there is a tangent plane to S at (a, b , c) and that it is the set of points ( x,y, z) 
that satisfy 

Vf(a,b,c)-(x-a 9 y-b,z-c) = 0. (3.9) 


X'(to) 



Fig. 3.6 Vf(a,b,c ) is normal to the k level set f(x,y,z ) -k at ( a,b,c ). 


Example 3.16. Find the plane tangent to the level set 


xyz + z 3 =3 


at the point (1,2,1). Let f(x,y,z) = xyz + z 3 . Then V/ = (yz, xz, xy + 3z 2 ) and 
V/(l,2,1) = (2,1,5). The tangent plane is given by 


(2,1,5) • (v- l,y-2,z- 1) = 0. 


□ 







124 


3 Differentiation 



Fig. 3.7 A sphere and its tangent plane at A, A • (X - A) = 0. See Example 3.17. 


Example 3.1 7. Let S be the sphere of radius R in R 3 centered at the origin, 
satisfying 

x 2 +y 2 + z 2 = R 2 . 

S is the R 2 level set of the function f(x,y,z) = x 2 +y 2 + z 2 . The gradient of / 
is 

Vf(x,y,z) = (2x,2y,2z). 

Using (3.9) the tangent plane at a point (a,b,c) on S has equation 
2(a,b,c ) • (x — a 9 y — b 9 z — c) = 0. 


Let A = (a,b,c) and X = (x 9 y 9 z). Then the equation of the tangent plane (see 
Figure 3.7) to the sphere S can also be written 

A • (X - A) = 0. 

□ 

Let S be the graph of a function / from R 2 to R. We can define a new function 
from R 3 to R by 

g(x,y,z) = f(x,y)-z. 

The level surface g(x,y,z ) = 0 is the same set of points in R 3 as the graph of /. The 
gradient of g is 

dg dg\,df 9f * 

8 'dx’dy’dz' 'dx'dy 9 >' 

Using (3.9) the equation of the plane tangent to the level surface g(x,y,z) = 0 at the 
point (a,b,f(a,b)) is 

Vg(< 2 , b , f(a , b)) • (x - a, y - b, z - f(a , b)) 


= (f x (a, b),f y (a,b) 9 -l)-(x-a,y-b,z- f(a, b)) = 0, 
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which we rewrite as f x (a , b)(x -a) + f y (a , b)(y -b)-(z- f(a , /?)) = 0 as we found 
earlier. 

The Chain Rule also holds if the first function in the composition is a function of 
n variables and the second maps R to R. 


Theorem3.7. Chain Rule 2. Suppose y = f(x i,. ..,x n ) is continuously dif¬ 
ferentiable on an open set U in R” and z - g(y) is continuously differentiable 
on an open interval V in R that contains the range of f. Then the composite 
g(f(x i,.. .,x n )) is continuously differentiable on U and for each Xi 


OXi 


d gdf_ 
dv dxi ’ 


(i = 1 ,...,n). 


Therefore 


D(go/)(X)=-fV/(X). 

dy 


Proof Since / and g are continuously differentiable on U and V their partial deriva¬ 
tives exist and are continuous there. By the Chain Rule for single variable functions 

dz dy _ dz 
d y dxi dxi 


Since — is a product of continuous functions these partial derivatives are continu- 

dxi 

ous on U. □ 

The most general form of the Chain Rule can be proved for compositions of 
continuously differentiable functions X from R k to R m and F from R m to R n . 

Theorem 3.8. Chain Rule. Let U be an open set in R k , and let V be an open 
set in R m . Suppose that X(T) = {x\ (t \,..., 4 ),..., x m (t \,..., 4 )) is continuously 
differentiable on U and F(X) = (y \(xi,..., x m ), ... ,y n (x 1 ,..., x m )) is continu¬ 
ously differentiable on V, and that the range ofX is contained in V. Then the 
composite F(X(T)) is continuously differentiable on U and the derivative of 
the composite is the product of the derivative matrices: 

D( F o X)(T) = DF(X(T))DX(T). 

Componentwise we can write this relation as 

dyj _ dy t dx 1 | dy t dx 2 | | dy t dx m 

dtj dx 1 dtj dx 2 dtj dx m dtj 


Figure 3.8 illustrates the composition of a function from R 3 to M 2 and a function 
from R 2 to R. 
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Proof. Consider each component function f(X(t)) = yi as a function of a single 
variable tj by holding the other 4 constant. We can use the Chain Rule for curves, 
Theorem 3.6, to find the partial derivative of the composite f(X(T)) with respect to 
tj. This gives 

fyi = dyi dxi | dy t dx 2 | | dyj dx m 

dtj dx\ dtj dx'i dtj dx m dtj 

Since we have assumed that the partial derivatives of X and F are continuous, each 
such partial derivative is continuous on U. Hence F o X is C 1 on U. By Theo¬ 
rem 3.4 F o X is differentiable on U. 

dyt 

The partial derivative — is by definition the i, j entry of the matrix 
dtj 

D( FoX)(T). 


According to (3.10) it is also the ij entry of the product 

DF(X(T))DX(T), 


where 


Z)F(X) = 


dy 1 dy\ 


' dx\ dx\ 

dx\ 8x2 . 


dt\ dt2 . 

tty 2 dyi 


dx2 0x2 

dx\ 8x2 

, DX( T) = 

dt\ dt2 . 

dyn dy n 


dx m dx m 

- dx\ . dx m - 


L dh . dt k J 


□ 


We can use the Chain Rule to get another version of the Mean Value Theorem. 
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Theorem 3.9. Mean Value Theorem Let F = (/ 1 ,/ 2 , . ,/m) be a C 1 map 

from R n to R m , on an open set in R n that contains points A, A + H, and the 
line segment 

A + fH, 0 < t < 1 

joining them. Then there exist numbers with 0 < 6i < 1 such that 

F(A + H) - F(A) = MH 

where M is the m by n matrix whose i-th row is Vf( A + 0/H). 


Proof Let 0/(0 - f(A + fH), 0 < t < 1. Then 


0,(0) = fi(A), 0/(1) =//(A+ H). 

By the Mean Value Theorem for single variable functions, there exists a number 6i 
between 0 and 1 so that 


f'fOt) = 0/(1) - 0/(0) = f(A + H) - f(A). 


By the Chain Rule, 


0/(0 = V/(A + tH) • H. 


Therefore for each component of F(A + H) - F(A) we have 


M A + H) - M A) = vy)( A + ft-H) • H. 

This shows that F(A + H) - F(A) = MH. □ 

Second derivatives. We turn now to second partial derivatives. 

Definition 3.8. A function / defined in an open disk in the x, y plane is twice 
continuously differentiable in the disk if its partial derivatives f x and f y exist 
and have continuous partial derivatives. The partial derivatives of f x and f y are 
denoted as f XX ifxy an d f yx ,f yy . They are called the second partial derivatives 
off. 


A twice continuously differentiable function is called a C 2 function. 

The following result is basic: 

Theorem 3.10. For a twice continuously differentiable function f on an open 
disk in the x, y plane, the mixed second partial derivatives are equal: 

/x y = f y x • 
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Proof. Let (< a , b) be a point in the open disk where / is twice continuously differ¬ 
entiable, and consider the following combination of translates of / in the x and y 
directions: Define 

C(h , k ) = f(a + h,b + k)- f(a,b + k) - [f(a + h,b)~ f(a , b )) (3.11) 

and let p and q be the functions 

p(x) = f{x, b + k)- f{x , b), q(y) = f{a + h,y)- f(a,y). 

We can write C{h,k ) in two different ways as 

C(h , &) = p (<2 + h) - p(a ) 

= q(b + k) — q(b). 

Applying the Mean Value Theorem twice for single variable functions to C(h,k ) we 
get that there is a number h\ between 0 and h and a number fci between 0 and k such 
that 

C(h,k) = hp'ia + hi) = h(f x (a + hi,b + k) — f x (a + hi,b)) = hkfxyia + h\,b + k\). 
Similarly there are /12 and ^2 such that 

C(h,k) = kqfb + kf) = k{f y ia + h,b + k 2 )-fyfab + kf)) = hkf yx {a + h 2 ,b + k 2 ). 
This shows that 

fxyia + h , b + * 1 ) = + h 2 ,b + k 2 ). 

Since and are assumed to be continuous functions, the left side tends to 
fxyia, b), and the right to f yx ia, b) as h and k tend to zero. This proves that the mixed 
second partial derivatives are equal. □ 


Problems 


3.18. Find the following derivatives or partial derivatives. 

d x3 

(a) — (/(x)) , where / is a differentiable function from R to R 

(b) fiy+x 2 ) 3 

OX 

d 2 

(c) — giy + x ), where g is a differentiable function from R to R 
ox 

d 3 

(d) — (/(x,y)) , where / is a differentiable function from R to R. 
dy 


3.19. Find the derivatives f xx , fxy , and 
(a) f(x,y) = x 2 -y 2 
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(b) f(x,y) = x 2 + y 2 

(c) f(x,y) = (x + y) 2 

(d) f(x,y ) = e _x cos;y 

(e) f(x,y ) = Q~ ay sin(ax), a constant 

3.20. Express the indicated derivatives of / as a constant multiple of / if this is 
possible. For example 

f(x,y) = ye ax , f xx = a 2 f. 

(a) f(x,y) = e“ T cosv, f xx + 2f yy =? (constant)/ 

(b) f(x,y ) = Q~ ay sin(ax), f xx =? (constant)/ 

(c) f(x, t ) = sin(x - 30, fx ~ 2ft =? (constant)/ 

(d) f(x, t) = cos(,v + c0, fxx ~ af tt =? (constant)/ 



Fig. 3.9 Level sets for Problem 3.21. 


3.21. Some level sets of a differentiable function / from R 2 to R are sketched in 
Figure 3.9. Which of the following derivatives are positive? 

(a) f x ( A) 

(b) f y ( B) 

(c) Du/(P) = U-V/(P) 

(d) Du/(Q) 

3.22. Recall the addition formulas 

cos (u + v) = cos w cos v - sin w sin v, sin(w + v) = sin w cos v + cos u sin v. 

(a) Use the addition formulas and the change of coordinates v = rcosO, y = rsin6to 
express the following functions, given in polar coordinates, in terms of x and y: 

f\ - r 2 cos(2^) fz = r ~ 2 sin(2^) fo = r 3 sin(3^) 

(b) Show that each function in part (a) satisfies the equation f xx + f yy = 0. 
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3.23. Let f(x,y) = 4 + 3v + 2 y + xy and A = (1,1). Find the directional derivatives 
D\f(A) = V • V/(A) in each direction. 

V = (1,0), V = (f,§), V = (0,1), V = (-|4), V = (-1,0). 
Why is one of these derivatives equal to ||V/(A)||? 



Fig. 3.10 A loaf of bread illustrates a mixed derivative fxy in Problem 3.24. 


3.24. Imagine a region in space occupied by a loaf of bread, and let f(a , b) be the 
volume of bread in the region where x<a and y<b, as in Figure 3.10. The following 
arguments illustrate f xy = f yx . 


f(a + h,b)~ f(a, b) 

(a) Note the difference quotient---; the numerator is the volume of 

h 

a certain slice of bread, and the denominator is the thickness. 

(b) Conclude that f x (a , b) is the area of the shaded x-a cross section. 

(c) Explain similarly why f y (a,b ) is the area of the unshaded y-b cross section. 

(d) Note the difference quotient ^ ; the numerator is the area of a 

k 

certain object, and the denominator is its width. 

(e) Conclude that f^a^b) is the length of the vertical segment at the comer where 
x-a, y-b. 

(f) Conclude by a similar argument that f yx (a , b) is also the length of the same seg¬ 
ment. 


3.25. Consider the partial differential equation 


u t + 3 u x = 0 


(3.12) 


for a differentiable function u(x,t). 

(a) Suppose / is a differentiable function of one variable. Show that the function 
u(x,t) = f(x-3t) satisfies (3.12). 
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(Here coordinates in R 2 are (v, t).) Therefore u is constant on lines that are parallel 
to V. 

(c) Show that the line that goes through point (x, t ) and that is parallel to V also goes 
through the point (x - 31,0). 

(d) Show that every solution u(x,t) has the property u(x,t) = u(x-3t,0), i.e., is a 
function of x-31. 

3.26. Let a(x,y ) be a continuously differentiable function of x and y, and define 

b(x) = a(x,x). Show that b is a differentiable function. 

3.27. Consider a function that depends only on the distance to the origin, that is, 



for some function g of one variable. 

(a) Find the function g(r ) in the case f(x,y ) = (x 2 +y 2 ) 3 , and verify the relation 


Xfx+yfy = rg r - 


(3.13) 


(b) Use the Chain Rule to show that relation (3.13) holds for all differentiable func¬ 
tions / that depend only on r as f(x,y ) = g(r). 

(c) Show that for twice differentiable functions of the form f(x,y ) = g(r). 


fxx + fyy ~ Srr + Sr- 
r 


3.28. For functions / of two variables let Af = f xx + f yy . Let a , b , p be numbers. 

(a) Find examples of polynomials f(x,y ) = ax 2 + by 2 for which Af is positive, zero, 
or negative. 

(b) Evaluate Af for the functions f(x,y) = x 4 +y 4 , f(x,y) = (x 2 +y 2 ) p . 

(c) Show that A(\og(x 2 +y 2 )) = 0. What is the domain of log(v 2 +y 2 )? 

3.29. Suppose u(x,y) is a C 2 function from R 2 to R. After a change to polar coordi¬ 
nates u(x,y) = u(r cos 6, r sin 6) we have 




du dx du dy du dx du dy 


dxdr + dy dr "" dxd0 + dydO' 


u s = -3- ^7 + -3- 




3.30. Show that the function u(x,t) = t 1|/2 e 


(t > 0 ) satisfies u t = u xx . 
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3.31. Let u(x,y ) = x 2 -y 2 and v(x,y) = 2xy. 

(a) Show that u x - and u y - -v x . 

(b) Show that u xx + u yy = 0. 

(c) Define w(x,y) = u(u(x,y),v(x,y)). Show that w xx + w yy = 0. 

(d) Suppose p , q , and r are a C 2 functions such that 

p x = q y , p y = -q x , r xx + r yy = 0. 

Define w(x,y) = r(p(x,y),q(x,y)). Show that + w-y-y = 0. 

3.32. Given twice differentiable functions / and g, use the Chain Rule to express 
the following derivatives of 


u(x , t) = /(x + At) + g(x - At) 

in terms of /, g, /", g". 

(a) u x and 

(b) u xx and u tt 

(c) + 

(d) u tt - I6u xx . 

3.33. Let z = i + ij be a complex number, and /(z) = u(x,y) + iv(x,y) a complex 
valued function. 

(a) For /(z) = z 2 , find w(x,y) and v(x,y). 

(b) Define / to be differentiable at z if there is a complex number m so that for all 
complex h near 0 the change 


f{z + h)-f(z) 

is well approximated by mh. Thus 

/(z + h) = f(z) + mh + r(h) 


where r is some function that is small in the sense that ——- tends to zero as the 

h 

complex number h tends to zero. Define f\z) = m. Show that (z 2 )' = 2z. 

(c) Suppose / = u + iv is differentiable. Take h real in part (b) to show that /' is equal 
to the partial derivative u x + iv x . 

(d) Take h imaginary, and show that /' is equal to -iu y + v-y. 

(e) If / = u + iv is differentiable conclude that 

u x ^ 

Uy — V X 

(f) Assume that u and v have continuous second partial derivatives. Deduce that 


3.4 Inverse functions 


Uxx + u yy — 0 ? Vxx + Vyy — 0, 

and verify these for the cases /(z) = z 2 and /(z) = z 3 . 
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3.34. Justify the following steps to prove 

d ryi(t) ryih) 

— g(x,t)dx = g t (x, t)dx + g(y 2 (t), t)y' 2 (t) - g(y\(t), t)y\ (t) 

at Jyi(t) Jyi(t) 

where g(x,t) is continuously differentiable in ( x,t ) and y\(t) and y 2 {t) are continu¬ 
ously differentiable functions of one variable. 


(a) Define f(u,v,w) = I g(x,w)dx. Use the Fundamental Theorem of Calculus to 

Ju 

show that 

fuiu , v, w) = -g(u, w) 9 Mu, V, w) = g(v, w). 

(b) Use differentiation under the integral to show that 


f w (u,v,w) 



-r~(x, w) dx. 
dw 


(c) Show that —f(yi(t),y 2 (t),t) = f u y\(t) +f v y' 2 (t) +f w . 


3.4 Inverse functions 

We recall the following result about differentiable functions of a single variable: 

Let f be a continuously differentiable function of a single variable x on an open 
interval, and suppose that at some point a in the interval, f'(a) is nonzero. Then 
f maps a sufficiently small interval around the point a one to one onto an interval 
around the point f(a). Denote the inverse function off by g , then g is differentiable 

at f(a), and g'(f(a)) = ——. 

f (a) 

In this section we state and prove the analogous theorem for functions from R 2 
to K 2 . That is, if a C 1 function F has an invertible derivative matrix at A then there 
is a disk about A where F has a differentiable inverse function. First we compare 
linear approximations of F at different points near A. 

Suppose F(v,y) = (f(x,y),g(x,y)) = (u,v) is a continuously differentiable function 
on an open set O in R 2 and that A = (a,b) is a point in O. By the definition of 
differentiability / and g can be well approximated near (a,b) by a constant plus 
linear function in the sense 
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u = f{x, y) = f{a, b) + f x (a, b)(x -a) + f y (a, b)(y -b) + n(x-a,y-b) 
v = g(x,y) = g(a, b) + g x (a, b)(x -a) + g y (a, b)(y -b) + r 2 (x -a,y-b ) (3.14) 

where 

\ri(x-a,y-b)\ \r 2 (x - a,y - b)\ 

. and 

a /(x — a) 2 + (y-b) 2 x-a) 2 + ( y-b) 2 

tend to zero as y/(x-a) 2 + (y-b) 2 tends to zero. 

Vector and matrix notation simplifies the description. Denote the vectors 


a 


x 


x-a 

, X = 


II 

X 

1 

> 

II 

b 


y_ 


y-b 


U = 


u 

v 


Denote the functions F(X) = 


g(x,y) 


the linear function 


L A (P) = 


fx(a,b) fy(a,b) 

gx(a,b) g y (a,b) 


P = DF(A)P 


and the remainder function R(P) = 
(3.14) is 


r\ (x — a,y — b) 
r 2 (x - a,y - b) 


. Then the vector notation for 


U = F(X) = F(A + P) = F(A) + L a (P) + R(P). (3.15) 

The next lemma compares the remainders R(P) and R(Q) at different points 
near A. 


Lemma 3.1. Let F = ( f,g ) be a continuously differentiable function defined 
on an open set O in R 2 . Then for every A in O there is a disk N r centered at A 
of radius r > 0 so that 

||R(P)-R(Q)|| < j(P,Q)||P-Q|| (3.16) 

for all A + P and A + Q in N r , and s(P, Q) tends to zero as r tends to zero. 


Proof Let N r be an open ball centered at A that is contained in O. For ||P|| and ||Q|| 
less than r, A + P and A + Q are in O. By (3.15) 

R(P) - F(A + P) - F(A) - L a (P), 

R(Q) = F(A + Q) - F(A) - L a (Q). 

Subtract R(Q) from R(P). By the linearity of La we get 

R(P) - R(Q) = F(A + P) - F(A + Q) - L A (P - Q). 






















3.4 Inverse functions 


135 


Since A + P and A + Q are in N r the segment from A + Q to A + P is in N r . By the 
Mean Value Theorem 3.9 there is a matrix M with rows 


for which 


Therefore 


V/}(.A + Q + 0f(P - Q)), 1 = 1,2, 0 < 6i < 1, 
F(A + P) - F(A + Q) = M(P - Q). 


R(P) - R(Q) = M(P - Q) - L a (P - Q) 

= M(P - Q) - DF(A)(P - Q) = (M - £>F(A))(P - Q). 

Taking the norm of both sides and applying Theorem 2.2 we get 
||R(P)-R(Q)||<||M-DF(A)||||P-Q||. 

This is inequality (3.16) with s(P,Q) = ||M-£>F(A)||. By the continuity of the partial 
derivatives, as r tends to zero, M tends to Z)F(A) so ||M - £>F(A)|| tends to zero. □ 


Theorem 3.11. Inverse function. Let U = F(X) be a continuously differen¬ 
tiable function from R 2 to R 2 defined on an open set containing A. If DF(A) 
is invertible, then F maps in a one to one way a sufficiently small disk around 
A onto a set of points in the U plane that includes all points in some circular 
disk around the point F(A). That is, on a small enough disk centered at A, F 
has an inverse, G, that is differentiable at F(A) and DG(F(A)) = Z)F(A) -1 . 


Proof (i) We show first that the function U = F(X) is one to one for X near A. 
Suppose that F(X) = F(Y), where 

X = A + P, Y = A + Q. 

We use formula (3.15) to express both F(A + P) and F(A + Q): 

F(A + P) = F(A) + L a (P) + R(P), F(A + Q) = F(A) + L A (Q) + R(Q). 

If F(A + P) is equal to F(A + Q), then 

L a (P) + R(P) = L a (Q) + R(Q), 

which implies that 

L a (P - Q) = R(Q) - R(P). (3.17) 

The matrix Z)F(A) representing L A was assumed to be invertible. Multiply both 
sides of (3.17) by Z)F(A) -1 . We get 

P - Q = Z)F(A ) -1 (R(P) - R(Q)). 



136 


3 Differentiation 


Since the two sides are equal, so are their norms: 

IIP-QII = l|OT(A)- 1 (R(P)-R(Q))ll < HZ)F(A)- 1 1 | ||R(P) - R(Q)||. (3.18) 

By Lemma 3.1 we can choose r so small that |s(P,Q)| < \ ||Z)F(A) -1 || 1 . Then by 
(3.16) 

l|R(P)~R(Q)ll < 2 ||Z> F (A ) — 1 1 | l|F ~ QI1 for A + P and A + Q in (3.19) 

where N r is the disk of radius r centered at A. Using (3.19) to estimate the right side 
of (3.18) we get 

IIP-QII < IIOT(A)- 1 1 | 2 || DF | A) -i, H p -QH = ^IIP-QII- (3-20) 

From this it follows that ||P — Q|| = 0; but then P = Q. This proves that the mapping 
U = F(X) is one to one for ||X-A|| < r. Therefore F has an inverse function. Call 
it G. 

(ii) Next we show that the range of F includes all points in some small disk 
around F(A). Let U be such a vector. We shall construct a vector X = A + P that is 
in N r and satisfies 

U = F(X) = F(A) + L a (P) + R(P). (3.21) 

Here r is the number determined in the proof of part (i). We construct P as the limit 
of a sequence of approximations V n defined recursively as follows: 

Po = 0 , 

U = F(A) + DF(A)P n + R(P„_i). (3.22) 

Multiply equation (3.22) by DF(A ) -1 to express P n as 

P„ =£»F(A)- 1 (U-F(A)-R(P„_ 1 )). (3.23) 

By Lemma 3.1, R is continuous. 

If the sequence P n converges to a point P then (3.21) follows from (3.22). 

To prove that the sequence P n converges to a point P with A + P in N r , we first 
show by induction that for all j 


IIP; —P,—(3.24) 
To start the induction we need the n = 1 case of (3.24). We have Po = 0 and 

Pj = Z)F(A) -1 (U-F(A)), 


SO 


||Pi-Poll < ||Z>F(A)- 1 ||||U-F(A)||. 
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We assume U is close enough to F(A) so that 

||£>F(A)- 1 ||||U-F(A)||<^. 
This makes ||Pi|| < 22 and specifies the small disk 


||U-F(A)|| < 


r 

2 2 ||ZDF(A)— 1 1| 


of U such that the sequence V n converges to a point P that satisfies F(A + P) = U. 
As n -th step of the induction we use the inductive assumption that 


IIP;-Py-ill< 


r 


for j = 1,2,..., n to estimate the norm of P ; , j = 1,2,..., n. We write 


P 7 - = Pi+(P 2 -Pi) + - + (P 7 -P 7 -_i). 


We deduce from this and (3.24) by the triangle inequality that 

llPjll < IIPi II + l|P 2 - Pi II + • • • + IIP; -P J - 1 |<^ + ^ + -+ ^ T . (3.25) 

The sum of this geometric series is less than this gives ||P y || < 

By definition (3.23) of F n the difference 

P„ + i - P„ = DF(A)- 1 (U - F(A) - R(P„» - DF(A)- 1 (U - F(A) - «P„-,)) 

= £»F(A)- 1 (R(P„_ 1 )-R(P„)). 

Therefore by (3.19) 

l|P» + l-Pi.il < 2 l|P»-P»-ll|. 

This completes the inductive proof of inequality (3.24) which shows that the 
sequence P j converges to a point P with ||P|| < | < r. Hence we have determined a 
point A + P in N r that is mapped by F to U. So the domain of G contains a small 
disk centered at F(A). 

(iii) To prove that G, the inverse of F, is differentiable at B = F(A), and to prove 
that its derivative is (Z)F(A )) -1 we show that 

l|G(B + K) - G(B) - (£>F(A)) - 1 K|| 


tends to zero as ||K|| tends to zero. Let 

H = G(B + K) - G(B). 
Since G(B) = A, we can rewrite this as 
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A + H = G(B + K). 

Applying F to both sides we get F(A + H) = B + K. Since B = F(A) it follows that 

K = F(A + H) - F(A). 

Using these relations and factoring out Z)F(A) -1 we see that (3.26) is equal to 

||H — (Z>F(A)) _1 K|| = ||(Z)F(A)) _1 (DF(A)H - K)|| 

IIK|| l|K|| 

_ ||(£»F(A))- 1 (Z)F(A)H-(F(A + H)-F(A)))|| 

“ iiKii ' 

Using the matrix norm (Theorem 2.2) we get that 


||G(B + K) - G(B) - (Z)F(A))- 1 K|| IIWA))' 1 ||||(z>F(A)H - (F(A + H) - F(A)))|| 


l|K|| 

We next show that 


l|K|| > 


nun 


l|G(B + K) - G(B)|| 


(3.27) 


(3.28) 


2||(Z)F(A))- 1 || 2||(Z>F(A)) —1 1| 

To prove this, multiply K = F(A + H) - F(A) = /_)F(A)H + 7?(H) by Z)F(A) -1 to get 


DF(A) _1 K = H + £>F(A) _1 /?(H). 


Theorem 2.2 and the triangle inequality then give 

HZ)F(A)- 1 1| ||K|| > ||£»F(A)- 1 K|| 

= ||H + Z)F(A) _1 /?(H)|| > ||H|| - ||£)F(A) _1 /?(H)||. 

In inequality (3.19) set P = H and Q = 0; we get 


ll*(H)|| < 


IIH|| 


(3.29) 


2||Z>F(A) —1 1| 

which implies 

||DF(A)- 1 7?(H)|| < ||£>F(A)- 1 ||||/?(H)|| < ±||H||. 

Replacing ||Z)F(A) _1 R(H)|| in (3.29) by the larger ^||H|| we get a new inequality 
||OT(A)- 1 ||||K|| > ||H||- |||H|| = 1||H||. 


Dividing by ||DF(A) : || completes the proof of (3.28). 
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Inequality (3.28) implies that G is continuous at B. If we replace ||K|| in the 
denominator of (3.26) by the smaller and use inequality (3.27) we 


get 


0 < 


2||(DF(A))- 
||G(B + K)-G(B)-(OT(A))- 1 K|| 


l|K|| 


< 2||(Z)F(A)) 


■i 2 ||F(A + H)-F(A)-DF(A)H|| 
IIHII 


(3.30) 


It follows from the continuity of G at B that as ||K|| tends to zero, ||H|| tends to zero. 
Since F is differentiable at A, 

||F(A + H) - F(A) - DF(A)H|| 

m 

tends to zero as ||H|| tends to zero. Using inequality (3.30) we get 
||G(B + K) - G(B) - (Z)F(A)) -1 K|| 


tends to zero. This shows that G is differentiable at B, and its matrix derivative at B 
is (Z)F(A)) -1 . That is, 

Z)G(F(A)) = (Z)F(A)) -1 . 

This concludes the proof! □ 


Example 3.18. The function F(x,y) = (2 - y + x 2 y, 3x + 2y + xy) maps R 2 to R 2 . 
It is continuously differentiable. 


F(l, 1) = (2,6), 


DF(x,y) = 


2xy 

3+y 


-1 +x 2 
2 + x 


DF(1,1) 


2 0 
4 3 


Since detZ)F(l, 1) ^ 0, by Theorem 3.11 F has an inverse defined in an open 
set about (2,6). The inverse is differentiable and 

£>F -1 (2,6) = (£>F(1,1)) -1 . 


Since the inverse of a 2 by 2 matrix is given by 
derivative of F -1 at (2,6) is 


a b 

i 

d -b 

c d 

ad-be 

-c a 


DF _1 (2,6) 


i 

30 


2 01 

6 

-4 2 


2 1 

L 3 3-1 


To estimate F 1 (2.1,5.8) we can use the linear approximation 
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F _1 (2 + .1,6 + (-.2)) * F _1 (2,6) + £>F _1 


.1 

-.2 
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( 2 , 6 ) 


1 


\ 0 

.1 


1.05 

1 

+ 

2 1 

L 3 3 J 

-.2 


. 86 ... 


□ 


Example 3.19. Given the system of equations 


2-y + x 2 y = 2 
3x + 2 y + xy = 6 

we see that (x, y) = (1,1) is one solution. Let F(x, y) = (2 - y + x 2 y , 3x + 2y + xy). 
We’ve seen in Example 3.18 that F has an inverse defined in an open set about 
(2,6). That means for each ( m , v ) in a disk centered at (2,6) of some small 
enough radius, the system of equations 

2 - y + x 2 y = u 
3x + 2 y + xy = v 


has a unique solution (x,y) and it is close to ( 1 , 1 ). □ 

The statement and the proof of the Inverse Function Theorem make no use the of 
the fact that the number variables is two. Therefore the ^-dimensional analogue of 
Theorem 3.11 holds for differentiable functions of any number of variables. 


Theorem3.12. Inverse function. Let F = (/i,/ 2 ,...,/ n ) be a continuously 
differentiable function on an open set O in ML. Assume DF(A) is invertible at 
a point A of O. Then F maps a sufficiently small open ball centered at A one 
to one onto a set that includes an open ball centered at F(A). F has an inverse 
G on this set. G is differentiable at F(A) and DG(F(A)) = (DF(A)) -1 . 


Implicitly defined functions. A consequence of the Inverse Function Theorem is 
the following theorem for functions from R 3 to R. 

Theorem 3.13. Implicit Function. Let f be a continuously differentiable 
function from a ball centered at a point P in x,y,z space R 3 , to R. Suppose 
f z ( P) + 0. Then all points X sufficiently close to P that satisfy /(X) = /(P) are 
of the form X = (x,y,g(x,y)) where g is a continuously differentiable function. 

The partial derivatives of g are related to the partial derivatives of f by 
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Proof. Define the vector function F (x,y,z) = ( x,y,f(x,y,z )). The matrix derivative 
of F is 


DF(*,y,z) 


1 0 0 
0 1 0 


fx fy fz 


Its determinant is f z (x,y,z). Since this is not zero at P = (puP2,P3), the Inverse 
Function Theorem gives us that F is invertible near P, with differentiable inverse. 
Let /(P) = c; then F(P) = (pup 2 ,c). F maps a small ball centered at P one to one 
onto a set in R 3 that includes a ball centered at F(P). See Figure 3.11. The inverse 
of F is of the form 

F -1 (x,y,w) = (x,y,h(x,y,w)) 

where h is a differentiable function. Define g(x,y) = h(x,y , c). Then g is differentiable 
and the relations 


F \x,y,c) = (x,y,g(x,y)), (x,y,c) = F(x,y,g(x,y)) = (x,y,f(x,y,g(x,y))). 


give 


f(x,y,g(x,y)) = f(P) = c. 


Differentiating both sides with respect to x by the Chain Rule gives 


(3.31) 


dfd(x) + dfd(y)^dfdg _ d(c) 
dx dx dy dx dz dx dx 


or 


—+ 0 +—— 

dx dz dx 


m - 0 . 


Since f\ P) ^ 0 and f z is continuous, it follows that f z is not zero for points suffi- 

fx 

ciently near P. Then dividing by f z we get g x = - — . Similarly differentiating both 

Jz 

sides of (3.31) with respect to y by the Chain Rule gives 


d£d(x^^d£d(y)^d£dg^ _ d(p) 
dx dy dy dy dz dy dy 


so 


^+ 0 +^ 

dy dz dy 


: 0 . 


Therefore g y = - 


fy_ 

fz 


□ 


Example 3.20. Consider the point (1,2,1) on the level surface 


xyz + z 3 = 3 

of the function f(x,y,z ) = xyz + z 3 . We compute f z (x,y,z) = xy + 3z 2 and 
f z ( 1,2,1) = 5 ^ 0. By the Implicit Function Theorem we can solve for z as 
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Fig. 3.11 A ball centered at P is mapped to a set that includes a ball centered at F(P). 


a function of v and y near (1,2). That is, there is a differentiable function 
g defined in an open disk centered at (1,2), with g(l,2) = 1, whose graph 
(v,y,g(.x,y)) is in the 3-level set of /, 

f(x,y,g(x,y)) = xyg(x,y) + g(x,y ) 3 = 3. 


The partial derivatives of g at (1,2) are 


gx(l>2) = - 


Ad,2,1) 

fz( 1,2,1) 


y z 


xy + 3 z 2 


(1,2,1) 


2 

5’ 


ft>(l,2) = - 


vz 


fy{ 1,2,1) = 

£(1,2,1) xy + 3z 2 


(1,2,1) 


1 

"5‘ 


The equation of the plane tangent to the graph of g at (1,2,1) is 


(- 


l,y-2,z- 1) = 0, 


or (2,1,5) • (x -1 ,y - 2, z -1) = 0, the same equation we found in Example 3.16. 
□ 

The Implicit Function Theorem says that a level surface S , f(x,y,z ) = k, of a C 1 
function / is locally the graph of a differentiable function, say z = g(x,y). If ( a,b,c ) 
is on S and f z (a, b,c)± 0 then 


(gx(a,b),g y (a,b),-l)-(x-a,y-b,z-c) = 0 (3.32) 


is the equation of the tangent plane at (a,b,c). Since 


gx(a,b) = - 


f z (a,b,c) 


and 


gy(a,b) = - 


f y (a,b,c) 

MaJycY 


equation (3.32) is the same as 
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(fx(a, b, c), f y (a, b, c), f z (a,b,c))-(x-a,y-b,z-c) = 0 


or 

Vf(a,b,c) • (x — a,y — b,z — c) = 0 


which is equation (3.9) in Section 3.3. 


Example 3.21. The pressure P, temperature T, and density p of a gas are 
related by 


P + ap 2 


RpT 
l- bp 


where a , b , and R are nonzero constants. Find an expression for the rate at 
which the density changes with respect to a change in temperature. Rather 
than trying to solve for p explicitly in terms of T and P we use the Implicit 
Function Theorem. Define 


f{P,T,p) = P + ap 2 - 


RpT 
l- bp' 


Then 


, RT bRpT 

■ tp ~ p i-b P (i -b P ? 



RT \ 

(i -bp) 2 J p 


is nonzero at a given P and T provided that 2 a - 


RT 


± 0. In that case 


(1 -bp) 1 

there is a function g defined near ( P\ T ), p = g(P, T). The partial derivative 


fj 

gT (PJ) = - J -± = 
Jp 


mth 

1 -bg{P,T) 


( 2 «-rafe 


R 


2a{\-bg{PJ))- 


RT 

(1 -bg(P,T)) 


Remark: The physics notation for gr in Example 3.21 is 

what variable was held constant. 

Next we state a more general version of Theorem 3.13. 



□ 

as a reminder of 
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Theorem 3.14. Implicit Function. Let 

F(X, Y) = 

be a C 1 function from R^ +m to R m on an open set that contains (A,B), and 
consider the set S in R n+m of points (X, Y) that satisfy the system of equations 

fi(xi,...,x n ,yi,...,y m ) = ci 
fl(Xi ,..., x n ,y \,... ,ym) = C2 

finite i,.. •, x n ,y \,... ,y m ) = c m . 

Suppose (A, B) = (a \,..., a n , b \,..., b m ) is in S and that the determinant of the 
matrix of partial derivatives 


Jp(A,B) 

Ydyj 

is not zero. Then there is a C 1 function Gfrom R" to R m such that every point 
(X, Y) in S sufficiently close to (A,B) can be expressed as (X, Y) = (X,G(X)), 
that is, 


y\ =8\( xi ,..., x n ) 
yi = g 2 (x u ...,x n ) 

ym — 8mi.2C \,..., X n f 


The partial derivatives of the f and gj are related by 


[dfii 


\dh . 

.. Wl] 


[dg\ i 


0 

dx j 

+ 

dyi 

dy m 


djcj 

— 


dfm 


dfm 

.. 


dgm 


0 

- ^ X i - 


-dy\ 

dy m - 


~ & x j - 



(3.33) 


Proof We will prove just the last part of this theorem. Assume such C 1 functions 
gi exist. Apply the Chain Rule to each function f, 


8 

dxj 


(fi(x 1,.. ., X n ,g lOi,... , X n ),.. ., g m (x 1,... ,X„))) 


d 

dxj 


Vi) = o. 


That gives 
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or 


dfi ^ (>fi dgi f dfi %2 | | dfi dgm 

dxj dy i dxj dyi dxj dy m dxj 


d fj . r m M 


Ml 

dym 


dg 1 
dx/ 
dg2 

dxj 


= 0 


dgm 

dXj 


as we see in the matrix equation (3.33). 


Example 3.22. The equations 


□ 


/i(xi,x 2 ,x 3 ,;yi,;y2) = 2x^1 +x 2 ^2 = 4 
/ 2 (xi,x 2 ,x 3 ,;yi,;y 2 ) = + *2^2= 3 


are satisfied by 

(xi,x 2 ,X3,yi,y 2 ) = ( 1 , 1 ,- 1 , 1 , 2 ). 

Find and at (1,1,-1,1,2). 

ox 2 <7X2 

To see whether y\ andy 2 are functions of (xi,x 2 ,x 3 ) near (1,1,-1,1,2) we 
check the determinant there: 


det 


r dp dp 1 

r 


= det 

- dyi dy 2 - 

- 


2x\ X 2 
4 xjxjyl 2xyyi 


- det 


2 1 
-4 4 


= 12 . 


That is not zero so the Implicit Function Theorem applies. Equation (3.33) 
gives at (1,1,-1,1,2), 


d_h 

d *2 

dfi 


dx 2 


+ 


2 1 
-4 4 


" dn ' 

dx 2 
dyi 

- dx 2 _ 



2 1 
-4 4 


_ dn ' 
dx 2 
dyi 

- dx 2 _ 


0 

0 


Multiply by the inverse matrix to get 


" dyi ' 
dx 2 
dyi 
- dx 2 _ 


1 

F2 


4 -1 
4 2 



□ 


Problems 

3.35. Let F(x,y) = (e x cosy,e* sin y). 

(a) Find the derivative matrix DF. That is, writing u = e x cosy, v = e x siny find 
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du du 
dx dy 
dv dv • 

. dx dy . 

(b) Show that Z)F(A) -1 exists at every point A = (a,b) so F is locally invertible at A. 

(c) Show that F is not invertible globally; that is, find different points of the domain 
that are mapped to the same value in the range. 

3.36. Let F(v,y) = (vcosy, vsiny) for v > 0. 

(a) Find the derivative matrix DF(A) at point A = (a,b). 

(b) Show that DF(A) -1 exists at every point with a > 0 so F is locally invertible 
at A. 

3.37. Let F(v,y) = (x 4 + 2xy + l,y), and consider the equations 

x 4 + 2xy + 1 = u 
y = v 


that is, F(v,y) = (u, v). 

(a) Show that for (w,v) = (0,1), one solution is (v,y) = (-1,1). 

(b) Find the derivative matrix £>F(x,y). 

(c) Show that DF(-1,1) is invertible, and hence F _1 exists on a small enough disk cen¬ 
tered at (0,1). 

(d) Use the linear approximation to estimate F _1 (.2,1.01). 

3.38. Let F(v,y) = (v + y -1 , x~ l +y), and consider the system of equations 

jt + y -1 = 1.5 
x~ l +y = 3 


(a) Show that (v,y) = (1,2) is a solution. 

(b) Show that the derivative matrix DF(1,2) is invertible. 

(c) Find a linear approximation for F _1 ( 1.49,2.9). 

3.39. Suppose / is a C 1 function from M 3 to R, with 

/(1,0,3) = /(1,2,3) = /(l,2,-3) = 5 


and 


V/(l,0,3) = (0,1,0), V/(l,2,3) = (4,1,-j), V/(l,2,-3) = (0,0,0). 

(a) Can we solve the equation /(v,y,z) = 5 for z as a function of (v,y) near (1,2, -3)? 
That means: does the Implicit Function Theorem guarantee existence of a func¬ 
tion g from a disk around (1,2) into R so that g(l,2) = -3 and f(x,y,g(x,y )) = 5? 

(b) Can we solve /(x,y,z) = 5 for y in terms of (x,z) near (1,0,3)? 

(c) Can we solve f(x,y,z) = 5 for y in terms of (x,z) near (1,2,3)? 
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(d) Can a function y = g(x,z ) that satisfies part (b) also satisfy part (c)? 

(e) Can we solve f(x,y,z) = 5 for v in terms of (y,z) near (1,0,3)? 

3.40. Let f(x,y,z) = z 4 + 2yz + x, and consider the level set 

z 4 + 2yz + v = 0. 

(a) Show that (0,2,0) is on the level set. 

(b) Find the partial derivatives of /. 

(c) Show that there is a function z = g(x,y) defined near (0,2) so that 

f(x,y,g(x,y)) = 0. 

(d) Find the partial derivatives g x (x,y ) and g y (x,y), and use them to find the second 
derivatives g xx (0, 2), ^(0,2), g w (0,2). 

3.41. Let 


fi (x,yi,y 2 ) = 3yi+yj + 4x = 0 
fl(x,y\,yi) = 4y\ +y 2 + x = 0 

(a) Verify that (x,yi,y 2 ) = (-4,0,4) is a solution. 

(b) Show that there is a function G = (gi,g 2 ) such that gi(x) = y 1 , g 2 (x) = y 2 for all 
solutions near (-4,0,4). 

(c) Show that at v = -4, 

d£ i = 4 dg2 = | 

dx 3 ’ dx 

3.42. Let F (u, v, w) = (u + v + w, uv + vw + wu), and consider the system of equations 
F(k,v,w) = (2,-4): 


u + v + xv = 2 
wv + vw + wu = -4. 


(a) Verify that (2,2, -2) is a solution. 

(b) Writing F(w,v,w) = (/i(w,v,w),/ 2 (w,v,w)), find all the first partial derivatives of 
fi and f 2 . 


(c) Verify that DF(2,2, -2) = 


1 1 1 
004 


(d) Can the equations be solved for v and w in terms of u near (2,2,-2)? That 
means: does the Implicit Function Theorem guarantee existence of a func¬ 
tion G = (g\,g 2 ) defined on some interval around 2 so that G(2) = (2,-2) and 
F(w,gi(w),g 2 (w)) = (2,-4)? 

(e) Can the equations be solved for u and v in terms of w near (2,2, -2)? 


3.43. Let T be the set of all points (x,y, w,v) in R 4 that satisfy 
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Fig. 3.12 A tangent vector at a point of the unit circle in Problem 3.43. 


x 2 +y 2 = l, 


ux + vy = 0. 


T is the set of all vectors tangent to the unit circle; see Figure 3.12. 

(a) We relabel coordinates to conform to the Implicit Function Theorem: Define 
F(xi,X 2 ,yi,y 2 ) = (x 2 +y 2 ,x 2 x 1 +y 2 yi) where x\ = x, y\ = y, X 2 = u and y 2 = v. 
Then T is the set defined by 


¥(x 1 ,x 2 ,yi,y 2 ) = (1,0). 


Calculate the matrix [ 


(b) Use the Implicit Function Theorem to deduce that near any point where y A 0, 
the equations x 2 + y 2 = 1, ux + vy = 0 can be solved for y and v in terms of x 
and u. 

(c) Find formulas for y and v in terms of x and u valid when y A 0. 

(d) Find formulas for x and u in terms of y and v valid when x A 0. 


3.5 Divergence and curl 

In this section we introduce two combinations of partial derivatives for functions 
from R 2 to R 2 or from R 3 to R 3 . Their significance for integration will be discussed 
in Chapter 8. Given 

G(x,y) = (gi(x 9 y),g 2 (x,y)) and F(x,y,z) = (/i(x,y,z),/ 2 (x,y,z),/ 3 (x,y,z)), 
the derivative matrices of G at (x,y) and of F at (x,y,z) are 



djl dfl d _h I 

dx dy dz 


fix fly flz 
fix fly flz 
fix fly flz 


dfi < 9/2 dfl 
dx dy dz 
df 3 dfi dfi 
dx dy dz 


Two special combinations of the partial derivatives of G and of F are the divergence , 
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dlvG= fi + fa, 

dx dy 


^ dfi df 2 d/s 

divF = A + A + A, 

dx dy dz 


and the curl , 
curlG = 


dg2 dgi 
<9* ^ ’ 


curlF: 


d/3 d/2 

dy dz ’ 


d/ 3 _d/(\ d _h 
dx dz / dx 


d A 

dy 


We use the notation V = and the formulas for computing dot product, 

cross product, and multiplication by a scalar to express the formulas for divF, curlF, 
and the gradient of /. For a differentiable vector field F = (fi,f 2 , f 3 ), 


divF = V-F = 


d d d 

7-/1 + 7-/2 + ^“/ 3 , 
ax oy <9z 


and 





i J 

k 




= V 

xF = 

det 

d_ d_ 

dx dy 

d_ 

dz 







Ifi h 

/d 




<9/3 

_d/2 


,d.h 

.A \ i+ 1 

r<9/ 2 

.A) k 
' k ' 


dz 


' dx 

«9z 9 J 1 

^ dx 



-( 


For a differentiable scalar valued function / from R 3 to I 

^f-wm 


Example 3.23. Let F (x,y,z) = ( x 2 ,xy,zy). Then 

divF = 2x + x + y = 3x + y 


and 

curlF = (z- 0, -(0 - 0),y - 0) = (z, 0,y). 


□ 

Example 3.24. Let F(x,y,z) = C*Ly> z ) f or ^ (0,0,0). Then 

(v 2 +y 2 + z 2 ) 3/2 

d / x \ (v 2 +y 2 +z 2 ) 3/2 -v|(v 2 +y 2 +z 2 ) 1/2 (2v) 

dx \ (x 2 + y 2 + z 2 ) 3 / 2 / ((x 2 + y 2 + z 2 ) 3 / 2 ) 2 

Canceling a factor of (v 2 +y 2 + z 2 ) 1 ^ 2 we get 

v 2 +y 2 + z 2 -3v 2 
lx (x 2 +y 2 +z 2 ) 5 / 2 


Similarly for f 2y and / 3 Z we get 
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_ x 2 +y 2 +z 2 - 3y 2 
[y ~ (x 2 +y 2 + z 2 ) 5 1 2 ’ 
x 2 +y 2 +z 2 - 3z 2 
(x 2 +y 2 + z 2 ) 5 / 2 


f2y = 
foz = 


Therefore divF (x,y,z) = 


3(x 2 +y 2 +z 2 ) - 3x 2 - 3y 2 - 3z 2 


= 0 . 


(x 2 +y 2 + z 2 ) 5 / 2 

Example 3.25. Let F be the inverse square vector field in Example 3.24, and 
let 

H(X) = -F(X). 

Then 

divH(X) = div(-F(X)) = -divF(X) = 0. 


Example 3.26. Let G(v,y) = (—y,x). Then 


and 


divGUv) = jr^-y) + 2-(x) = 0 
ox oy 


curlG(x,y) = 2-(x)-2-(-y) = 1 -(-1) = 2. 
ox oy 


Example 3.27. Let G(x,^) = I - - , - — - ), ( x,y) ± (0,0). Then 

\x z +,y z x z +y zf 


divG = —- 


-y 


d 
+ — 


dx\x 2 +y 2 ) dy\x 2 +y 2 ) (x 2 +y 2 ) 2 (. x 2 +y 2 ) 2 


-(-y)2x + ~(x)2 y 


= 0 , 


and 


curlG(v,^) = 


8 

dx 


x \ d 
x 2 +y 2 ) dy 


-zl2\ 

x 2 +y 2 ) 


(x 2 +y 2 ) l-x(2x) 
(x 1 + V 2 ) 2 


(x 2 +y 2 )(-l)-(-y)2y 
(x 2 +y 2 ) 2 


_ 2(x 2 +y 2 )- 2x 2 - 2y 2 
~ (x 2 +j 2 ) 2 ~ 0 ' 

□ 

Next we investigate what the divergence and curl tell us about the vector field 
itself. 

Divergence. Suppose that F (x,y 9 z) = (f\(x,y,z),f 2 (x,y,z),fy(x,y,z)) represents the 
velocity of a fluid at point (x,y, z) at some moment in time. Imagine a small box with 
one vertex at the point P = (xo,yo,zo) and the edge lengths Ax , Ay, Az, as illustrated 
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in Figure 3.13. Let N be the outward unit normal vector at points on the surface of 
the box. 



(x 0 + Ax, y 0 , z 0 ) 


Fig. 3.13 Fluid flows through a box. 

We estimate the rate of flow out of this box across its faces. Denote the midpoint 
of the box as (x m ,y m ,z m ). For a C 1 vector field F on a small box, at points of the 
bottom face F is approximately F(x m ,y m ,zo), and on the top face is approximately 
F(v m ,y m ,zo + Az). The combined rate at which fluid flows out of the box through 
these two faces is approximately 

FO m ,y m ,z 0 + Az) • (0,0, l)AxAy + F(x m ,y m ,z 0 ) • (0,0, -1 )AxAy 

= (f 3 (x m ,y m ,z 0 +Az) - f 3 (x m ,y m ,z 0 ))AxAy. 

See Figure 3.14. Using the Mean Value Theorem of calculus this is equal to 



dz 


where z lies between zq and zq +Az. 


( 0 , 0 , 1 ) 


x 



Fig. 3.14 Approximate flow across the top and bottom of the box. 
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By similar arguments the approximate outward flow rate of F across the faces of 
the box parallel to the x, z plane is 

dfi _ 

— (x m ,y,z m )AyAxAz 
dy 


for some y between yo and yo +Ay, and across the faces of the box parallel to the y, z 
plane is 


df\ _ 

—(x,y m ,z m )AxAyAz 

OX 


for some v between vo and vo + Ax. 

The total approximate outward flow rate across the surface of the box is 


df\ _ df 2 _ <9/3 

—(x,y m ,z m )+— (x m , y, z m ) + — (x m , y m , z) 
ox dy dz 


| AxAyAz. 


We define the average flux density to be the net outward flow rate across the surface 
divided by the volume of the box. Then the average flux density of F across the 
surface of the box is 


df\ _ < 9/2 _ < 9/3 _ 

( x,y m ^z m ) H — (x m ,y,z m ) H — (v m ,y m ,z). 
ox oy oz 


Since the partial derivatives are continuous, as Ax, Ay, and Az tend to zero, T and x m 
tend to vo, y and y m tend to yo, and z and z m tend to zo, and the average flux density 
tends to 


df\ df2 df'x 

—(v 0 ,yo,z 0 )+ —(v 0 ,y 0 ,^o)+ -z-(xo,yo,z 0 ) = divF(P). 

ox oy oz 


divF(P) is called the flux density of F at P. 



Fig. 3.15 Outward flux is positive, in Example 3.28. 


Example 3.28. Let F(v,y,z) = (v,0,0). Then 


divF = 


dx <90 (90 
dx dy dz 
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See Figure 3.15. □ 

Curl in R 2 . Imagine that G(v,y) is the velocity of a fluid in the x,y plane at some 
moment in time, and suppose G is continuously differentiable. Consider the tangen¬ 
tial component of G as we traverse the boundary of a small rectangle as shown in 
Figure 3.16. 


y 


° / 

/ . 

{p^mi Vm) 


(xq + Ax, 2 /q + Ay) 



Oo, 2/o) 


X 


Fig. 3.16 We consider the tangential component of G at the boundary. 


Because G is continuous and the rectangle is small, G(v,y) for points along 
the lower edge is approximately G(v m ,yo)> along the right edge approximately 
G(vo + Ax,y m ), along the top edge approximately G(x m ,yo + Ay), and along the left 
G(vo,y m ). The product of the tangential component of G on an edge times the length 
of the edge is the circulation of G along that edge. The circulation of G around the 
rectangle is 

G(v m ,y 0 ) • (l,0)z/v + G(vo + A\x,y m ) • (0, \)Ay 

+ G(v m ,y 0 +4)0 • (~l,0)Ax + G (xo,y m ) • (0,-1 )Ay. 

Applying the dot products the circulation of G around the rectangle can be rewritten 
= gi (x m ,y 0 )Ax + g2(*o +Ax,y m )jy - g x (x m ,y 0 +Ay)Ax - g 2 (x 0 , y m )Ay. 

By the Mean Value Theorem this equals 

(- g\ y (x m ,y)Ay)Ax + (g 2x (x,y m )Ax)Ay. 

As Ax and Ay tend to zero, x and x m tend to xq, and y and y m tend to yo- Since the 
partial derivatives are continuous, the circulation densities 

(~ gi y (x m ,y)Ay)Ax + (g 2 x(x,y m )dx)Ay 
AxAy 


tend to 


-ghyCVb^o) + g 2 x(xo,yo) = curlG(v 0 ,yo). 
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Example 3.29. Let G(v,y) = (-y 9 x) be a velocity field. A brief calculation 
shows that curl G = 2 at every point. We observe that 

G(h,0) = (0,h) 

G(0,h) = (-h,0) 

G(-/*,0) = (0 ,-h) 

G(0,-A) = (/i,0). 


If you sketch this vector field with h > 0 small, it suggests that near the origin 
G is the velocity of a counterclockwise rotation. □ 

Example 3.30. Let G(A,y) = (y,0). Then 

curlG(x,y) = Ao)- Av) = -1. 
ox oy 

As Figure 3.17 shows, small objects floating in this field rotate clockwise. □ 



Fig. 3.17 The field in Example 3.30. 


The next example shows that the overall rotation of a fluid may differ from the 
local rotation or curl. 

Example 3.31. Let F be the vector field F(v,y) = ^ , where p > 0. A 

(. x l + y z ) p 

sketch of F is illustrated in Figure 3.18. The field appears to rotate counter- 

2-2 p 

clockwise about the origin. The curlF = — --— is negative, zero, or positive 

(x z -\-y z ) p 

when the exponent p is greater than 1, equal to 1, or less than 1 as we ask you 
to verify in Problem 3.47. □ 

Curl in R 3 . Take a small box with one vertex at (voAo^o) an d edge lengths 
Ax,Ay,Az. Denote the center point (x m ,y m ,z m ). At the center of each face form the 
product (N x F)(Area of the face), sum over all faces and divide by the volume of 
the box. We get 

((-1,0,0) x F(x 0 ,y m , z m )AyAz + (1,0,0) x F(x 0 +Ax, y m , z m )AyAz 


+ (0, -1,0) x F(x m ,y 0 ,z m )AxAz + (0,1,0) x F(x m ,y 0 +Ay, z m )AxAz 

+ (0,0, -1) X F(x m , y m , z 0 )AxAy + (0,0,1) X F (x m ,y m ,z 0 +Az)AxAy) —2 


AxAyAz 
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Fig. 3.18 A sketch typical of the vector fields F in Example 3.31. The curl of F may be positive or 
negative. 


In Problem 3.52 we ask you to show that this tends to curlF((vo,yo>^o) as Ax, Ay, 
and Az tend to zero. 

Gradient. In Problem 3.53 we ask you to explore how the formula for the gradient 
of the pressure / at a point in a fluid arises by looking at the pressure forces on the 
surface of a small box. The pressure force on each face is approximately the product 
(- /N(Area of the face)) where / is the pressure at the midpoint of the face. We ask 
you to show that the approximate total pressure force on the surface of a small box, 


- f(x 0 ,y m ,z m )(-l,0,0)AyAz - f(x 0 +Ax,y m ,z m )(l,0,0)AyAz, 

~ f(x m ,y 0 ,z m )(0, -1,0)AxAz - f(x m ,y 0 + Ay, z m )(0,1,0)AxAz 

-f(x m ,y m ,z 0 )(0,0,-l)AxAy-f(x m ,y m ,zo +Az)(0,0, l)AxAy 


is equal to 

— C/x(-T>Tm? z m),fy(Xm>yi z m)-> Z^AxAyAz 

for some T between xo and xo +Ax, y between yo and yo +Ay, and z between zo and 
zo +Az. Dividing by the volume we see that for continuous partial derivatives the 
force per volume 


C ymi z m)-> fyi^nuy^ -^m)?/z(-^m?3 ; m? -^)) 


tends to 


-Vf(xo,yo,z 0 ) 


as Ax, Ay, and Az tend to zero. Thus the gradient of the pressure at a point can be 
interpreted as a vector that arises from pressure differences on opposite sides of a 
small box and represents the total force per volume at that location. 
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Laplace operator. Another important operator defined for scalar functions is the 
composite of divergence and gradient 


div grad/ = 


dx 2 dy 2 dz 2 


It is denoted as Af and called the Laplace operator. 

The gradient and the Laplace operators are linear: that is for all once, respectively 
twice, differentiable functions / and g and all numbers a and b , 


V(af + bg) = aVf + bVg 
A(af + bg) = aAf + bAg 


We define the Laplace operator A for a twice differentiable vector valued function 
componentwise. It is a linear operator, and so are the operators curl and div; that is, 
for all twice differentiable functions F and G and numbers a and b , 


A{aF + bG) = aA¥ + bAG 
curl (aF + bG ) = acurlF + Z?curl G 
div (aF + bG) = adivF + MivG 

Vector differential identities. Let / and g be scalar valued functions from R 3 to R 
and F and G be vector fields from R 3 to R 3 . Assuming that all required first, second, 
and sometimes third, partial derivatives exist and are continuous, we have lots of 
identities. 

We ask you in Problems 3.56-3.59 to verify the following identities. In (e) and 

(f) we define 

F-VG = /iGjc + f 2 G y + h G z = (DG)F. 

(a) V(/g) = /Vg + gV/ 

(b) div (/F) = /divF + F • V/ 

(c) div (F x G) = (curl F) G - F curl G 

(d) curl (/F) = /curlF + (V/) x F 

(e) curl(F x G) = (divG)F + G VF - ((divF)G + F • VG) 

(f) V(F • G) = F x curlG + G x curlF + F • VG + G • VF 

(g) curl curlF = V(divF)-zlF where AF = (Af\,Af 2 ,Af 2 ) 

(h) A(fg) = fAg + 2Vf-Vg + gAf. 

(i) div(V/ x Vg) = 0 
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In the next two examples we prove the identities 

• divcurlF = 0 

• curlV/ = 0 


Example 3.32. Assume / has continuous second partial derivatives. We verify 
that curl (V/) = 0. 

curl (V/) = curl (f x ,f y ,fz) 



Since the mixed second partial derivatives of a twice continuously differen¬ 
tiable function are equal, this is (0,0,0). □ 

Example 3.33. Let F = (/i, / 2 , f?,) be a twice continuously differentiable vector 
function. We show that divcurlF = 0. 


curlF = V xF 


-j-h-j-h,- 
dy dz 


ox oz I ox oy 


So 


divcurlF = V • (V x F) 


d Id . d \ did , d \ did , d A 

= 7T ~ ~z~ J2 + -z-J l - Z-J3 + z-\ Z-J2 ~ z-J l 

dx \dy dz ) dy\ dz dx ) dz\ dx dy ) 

d d r d d r d d r d d r d d r d d r „ 

- h - fi 3- fi -ft h-A- f\ — 0. 

dx dy dx dz dy dz dy dx dz dx dz dy 

□ 


Problems 


3.44. Compute the indicated curls and divergences. 

(a) curl (v, 0,0), div(v,0,0) 

(b) curl(0,v,0), div(0,x,0) 

(c) curl(0,0,v), div(0,0,v) 

3.45. Let F(x,y,z) and G(x,y,z) be differentiable vector fields that satisfy 

curlF (x,y,z) = (5y + 7z,3*,0), curlG(l,2,-3) = (5,7,9), 

divF(l,2,-3) = 6, divG(v,y,z) = x 2 -zy. 

(a) Find div(3F + 4G) at the point (1,2,-3). 

(b) Show that divcurlF(v,y,z) = 0. 

(c) Find curl(3F + 4G) at the point (1,2,-3). 
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3.46. Let F(x,y,z) = (xy 2 ,yz 2 ,zx 2 ). Find curlF(l,2,3) and divF(l,2,3). 


3.47. Consider the vector field H(x,y) = 


(-y,x) 


(x 2 +y 2 ) p 

the vectors H(1,0), H(0,1), H(-1,0), and H(0,-1). Show that 


for points other than (0,0). Draw 


curlH = 


( 2-2 p) 

(x 2 + y 2 )P 


which is negative, zero, and positive when p = 1.05, 1, .95, respectively. 


3.48. Verify that curl(w(x,y),v(x,y),0) = (0,0,v x - u y ). 

3.49. Suppose ( u,v,w ) satisfies curl (u,v,w) = 0 in R 3 , i.e., 


u y = v x , u z = w x , v z = Wy. 
Prove that there is p such that Wp = ( u,v,w ): 


p x = u, Py — v, Pz = w, 
by verifying the following steps. 

(a) Define p\ = I u(s,y,z)ds , Then p\ x = u , and p\ y = v + n, where n does not 

Jo 

depend on v. 

r 

(b) Define P 2 = Pi ~ c(y,z ) where c = I n(t,z)dt. Then p 2 x = u and p 2 y = v. 

Jo 

(c) Write p 2 Z = w + m. Then m doesn’t depend on v and y. 

(d) Define P 3 = P 2 ~ f(z) where f z = m. Then p^ x = u , p^ y = v, and p^ z = w. 

3.50. A vector field F has a vector potential G when it can be expressed F = curl G. 


Show that the inverse square field - 


lixii 3 

*3 


(xi,x 2 ,x 3 ) 

"(x2+*2+x2)3/2 

(-*2, *i,0) 


has a vector potential 


(xl+xl + xj)^ x]+xl 

at all points except along the *3 axis. 


3.51. Suppose a differentiable vector function V(X,f), that is, three components 
depending on the four variables of space and time, gives the velocity of fluid parti¬ 
cles moving in R 3 , so that 

?=V(X(0,0 

dt 

for one of the particles whose position is given by X(t) at time t. 

(a) Show that the acceleration of the particle is given by 


^A=V f + CDV)V. 
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(b) Verify that an example is X(t) = C\ +t 1 C 2 , V(X, t) = t 1 (Ci - X) where Ci and 
C 2 are constant vectors. 

3.52. Show that 

((-1,0,0) xF(x 0 ,Vm, z m )AyAz + (1,0,0) X F(x 0 +Ax,y m , z m )AyAz 


+ (0, -1,0) x F(x m ,yo,z m )AxAz + (0,1,0) x F(x m ,vo +Ay,z m )AxAz 

+ (0,0,-1) X F(x m ,y m ,z 0 )AxAy + (0,0, 1) X F(x m ,y m ,z 0 +Az)AxAyj 

tends to curlF((%o,yo ? ^o) as ^» 4y, and Az tend to zero. We used this in our geo¬ 
metric description of curlF. 

3.53. Show that the sum of pressure forces over the faces of a small cube 


-f(xo,y m ,z m )(-l,0,0)AyAz -f(x 0 +Ax,y m ,z m )(l, 0,0)AyAz 
-f(x m ,y o, Zm)(0, -1,0 )AxAz - f(x m ,yo +Ay, z m )( 0,1,0 )AxAz 
-f(x m ,y m ,z 0 )(0,0,-l)AxAy-f(x m ,y m ,z 0 +Az)( 0,0,1 )AxAy 


is equal to 

-{fx(x,ym,Zm),fy(x m ,y,Zm)Jz(x m ,y m ,z))AxAyAz 

for some x between xo and xo+Ax, y between yo and yo +Ay, z between zq and 
zo -i-Az. 

3.54. Show that div(vVw) = vAu + Vu • Vv for C 2 functions u , v from R 3 to R. 

3.55. Show that A(uv) = uAv + vAu + 2Vu • Vv for C 2 functions u , v from R 3 to R. 

3.56. Prove the vector differential identities (a)-(d) listed in the text. 

3.57. Prove the vector differential identity (e) listed in the text. 

3.58. Prove the vector differential identity (f) listed in the text. 

3.59. Prove the vector differential identities (g)-(i) listed in the text. 

3.60. Let f(y, z) be a C 2 function from R 2 to R, and suppose there is a number c so 
that 

fyy + fzz ~ ~C /• 

(a) Show that the vector field F = (cf,f z , -f y ) has the property 

curlF = cF. 

(b) Show that the function /(y, z) = sin(3y + 4 z) has this property, and find the corre¬ 
sponding vector field F. 



Chapter 4 

More about differentiation 


Abstract This chapter describes applications of the derivative to methods for find¬ 
ing extreme values of functions of several variables, and to methods for approximat¬ 
ing functions of several variables by polynomials. 


4.1 Higher derivatives of functions of several variables 

In Theorem 3.10 we showed that if all the second partial derivatives of a function 
f(x,y ) are continuous on an open set in R 2 , then 

fxy — fyx- 

Analogous results hold for differentiable functions of several variables, and for par¬ 
tial derivatives of all orders. 


Definition 4.1. A function / from an open set in R^ to R is called n times 
continuously differentiable if it has all mixed partial derivatives up to order n , 
and if all n-th partial derivatives are continuous functions. 


An n times continuously differentiable function is called a C n function. 

Theorem 4.1. Let f be a C n function from an open set in R^ to R. Then two 
partial derivatives off of order less than or equal to n, in which differentiation 
with respect to each variable occurs the same number of times, are equal, 
regardless of the order in which these partial differentiations are carried out. 


This result can be proved by repeated application of Theorem 3.10. For example, 
for a C 2 function f(x,y,z ) if we hold z fixed then the theorem gives fxy = f yx . Simi¬ 
larly f yz = f zy and f zx = f xz . And, if / is C 3 , then five applications of Theorem 3.10 
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give 


fxyz ~ fxzy ~ fzxy ~ fzyx ~ fyzx — fyxz- 


Example 4.1. Let 


f(x,y,z) = x 2 yz 4 . 


Find fxzyy We can compute the partial derivatives in the given order. 


fx = 2xyz 4 , fxz = %xyz 3 > Azy = 8xz 3 , fxzyy = 0- 


Alternatively by Theorem 4.1 we can write 



Since f yy = 0, we get f yyx - = 0. 


□ 


A partial differential equation or pde relates partial derivatives of a function. We 
say that a function is a solution of the pde if it satisfies the equation. 

Example 4.2. Let u(x,y,z,t) = e“^(cosv +cosy + cos z). We show that u is a 
solution of the partial differential equation 


u t -kAu = 0, 


where A is the Laplace operator defined as Au- u xx + u yy + u zz . A brief calcu¬ 
lation shows that 


u t = —ke ^(cosv + cosy + cos z), 


Uxx = e kt (-COSX), 

Uyy = e“^(-cosy), 
Uzz = e -fa (-cosz). 


Therefore 

kAu = k6 ~ k \-cos v- cosy- cos z) = u t 
and u satisfies the equation u t -kAu = 0. 


□ 


Problems 

4.1. Compute the partial derivatives. 

(a) fxx and f zz for (x,y,z) * (0,0,0) if f(x,y,z) = (x 2 + y 2 + z 2 )~ 1/2 

(b) fxyz if fix, y, z) = xy + yz + zx 

( c ) gxxy ~ gxyx if g is three times continuously differentiable from R 3 to R 

(d) h Xj and h XjXk if h(X ) = A • X, where A = (a\ ,..., a n ) is constant and X = (x\ ,..., x n ). 


4.2. Find the partial derivatives. 


4.1 Higher derivatives of functions of several variables 
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(c) (x 3 y+y i z + Z 3 w)yy W 

(d) ^-(||X|| 2 ) for X in IT, n > 5 

OX 5 

d 2 , 

(e) ——(||X|| 2 )forXinR«, n> 5. 

OX 5 OX 3 

4.3. Let f(x,y) = (x 2 +y 2 ) 3/2 . Find f xy . 

4.4. Let /(X) = cos(A • X) where X is in R n and A is a constant vector in R n . That 
is, 

f(x\,x 2 ,...,jc w ) = cos(aiVi + CL 2 X 2 H- 1 -^ 4 ). 

(a) Show that / Xl = -rq sin(A • X). 

(b) Find the derivatives f X2X2 and f X3X2X2X4 . 

(c) Verify that f XlXl + f X2X2 + • • • + f XnXn = -||A|| 2 /. 

4.5. Define g(x,y,z, w) = Q ax+b y +c ^ +dw s where a , b , c, and J are constants. 

(a) Show that g x = ag. 

(b) Find the partial derivatives g xxww and g yyzz - 

(c) Find a relation among the constants a , b , c, and J so that g satisfies the equation 

c^xxww + £_y_yzz — 2gx^zw = 0- 

(d) Find a relation among the constants a , c, and J so that g satisfies the equation 


gxxww + Syyzz ^g — 0. 


4.6. For a linear combination of differentiable functions / and g from R n to R, with 
a and b constants, the partial derivatives satisfy 


X( a f + b S ) = a^-+b^-, 

OXk OXk OXk 


k- 1 


Show that the following formulas are true for C 4 functions /, g , and /l 

<9 2 d 2 / d 2 g 

{a) ia f +bg)=a ^ +b 


(b) 


<9 3 d 3 / 

(af + bg) = a 


dxtdx, 


d 3 g 

dxldxr 8x}dx 


+ b 


, £=l,2,...,n, fc=l,2,...,n. 


svjijXi OXjOXl OXj^OXl 

(c) (af + Z?g + c/z) Xl = af Xl + g Xl + c/z Xl 

(d) If f,g,h are C 4 functions from R 5 to R then 


{af + bg + ch) XlX3X4X2 — af XlX4X3X2 + bg X2X3X4Xl + c/t XlX3X4X2 . 
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4.7. Show that the function u(X , t) - 1 w / 2 e H x H 2 / 4? , where X is in R n and t > 0, satis¬ 
fies the equation 

= U X\X\ + 4 I" U X n X n ’ 

4.8. There are at most three distinct second partial derivatives of a C 2 function 
f(x,y ), namely / xx , / X3 ,, and How many distinct third partial derivatives could 
there be of a C 3 function f(x\, x 2 , x 3 , X4)? 


4.9. For X in R n (n > 3) and for every twice differentiable function u(X ) we define 
Au = u XxXx + u X2X2 + • • • + u XnXn . Let r = ||X||. Show that 

(a) For every constant p, (r p ) Xj = pr p ~ 2 xj , (r ^ 0). 

(b) A(r 2 ~ n ) = 0, (r*0). 

4.10. Suppose w is a C 4 function from R 2 to R. Define zlw = u xx + 

(a) Show that zlzl U — ^ XXXX 2u xxyy "f Myyyy • 

(b) Find examples of polynomial functions w(v,y) = ax 4 + Z?v 2 y 2 + cy 4 , 0, for 

which AAu is always positive, zero, negative. 

(c) Show that for (v,y) ± (0,0), A(x 2 +y 2 ) 1/2 = (x 2 +y 2 ) _1/2 . 

(d) Suppose that u(x,y) = v(r), where v is a C 2 function from R to R for r > 0 and 

r = +y 2 . Show that 

zlw = v rr + r _1 v r . 

(e) Show that for (v,y) ± (0,0), zlzl(v 2 +y 2 ) 1/2 = (x 2 + y 2 ) _3/2 . 

4.11. Suppose that w(v,y, z) is a C 2 function from R 3 to R. Define Au = u xx + u yy + 
and let r = ^/v 2 +y 2 + z 2 . 

(a) Show that for (x,y,z) ± (0,0,0), A(x 2 +y 2 + z 2 ) 1/2 = 2(x 2 +y 2 + z 3 ) _1/2 . 

(b) Suppose u(x,y,z) = v(r), where v is a C 2 function from R to R. Show that for 
r > 0 

Au - v rr + 2r -1 v r . 

(c) Show that for (x,y,z) ^ (0,0,0), zLl(v 2 +y 2 + z 2 ) 1/2 = 0. 

4.12. Let F (x,y,z,t) = (f\(x,y,z,t),f 2 (x,y,z,t),f 3 (x,y,z,t)) be a C 2 function on an 
open set in R 4 . Define 


curlF = 


d AJA d AJA d AJA 

dy dz dz dx* dx dy 


^ dfi df 2 df 3 

divF = A + A + A> 

dx dy dz 
AV = (Af u Af 2 ,Af 3 ). 


(a) Show that curl(F^) = (curlF) ? . 

(b) Show that curl curlF = VdivF -AF. 
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(c) Suppose that 


E(x,y,z,t) = ( Ei(x,y,z,t),E 2 (x,y,z,t),E 3 (x,y,z,t )) and 
B (x,y,z,t) = ( Bi(x,y,z,t),B 2 (x,y,z,t),B 3 (x,y,z,t )) 

are C 2 functions on an open set in R 4 that satisfy the following equations, called 
Maxwell equations, 

E* = curlB, B t = -curlE, divE = 0, divB = 0. 

Show that E tt = AE and B tt = A B. 

(d) Verify that the functions 

B(X, t) = (cos(y - 1), 0,0), E(X, t) = (0,0, cos(y - 1 )) 

satisfy the Maxwell equations. The fields are sketched in Figure 4.1 at one par¬ 
ticular time t at the left, and at a slightly later time at the right. 




Fig. 4.1 A sketch of the vector fields in Problem 4.12, part (d), at two different times t. 


4.2 Extrema of functions of two variables 

Just as in single variable calculus a function / from R n to R may have points in the 
domain at which it attains a local maximum or minimum. 
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Definition 4.2. Let f : D cR”^R. We say that / has a local minimum /(A) 
at a point A in D if there is an open ball in R n centered at A so that for every 
point X in D that is in the ball, 

/(X)>/( A). 

/ has a local maximum /(A) at a point A in D if there is an open ball in R n 
centered at A so that for every point X in D that is in the ball, 

/(X)</( A). 

A point that is either a local maximum or a local minimum is called a local 
extremum of / on D. We say a local maximum or local minimum is strict if 
the corresponding strict inequality /(X) > /(A) or /(X) < /(A) holds except 
for X = A. 


Recall that a differentiable function / of a single variable that has a local 
extremum at an interior point a of an interval satisfies the condition 

f\a) = 0. 

This result has the following extension to functions of several variables. 

A first derivative test. We have the following test for local extrema at interior 
points. 

Theorem 4.2. Let f be a differentiable function from D c R n to R. Suppose 
f has a local extremum /(A) at the interior point A. Then the first partial 
derivatives off are zero at A, that is, 

V/(A) = 0. 


Proof This result is an immediate consequence of the result for functions of a single 
variable. Suppose f(x \,..., x n ) has a local extremum at an interior point A of D. 
Hold all but the i- th coordinate fixed and let Xi vary. (See Figure 4.2) Since A is an 
interior point there is an interval on which the single variable function 

f(au...,ai-i,x h a i+ u...,a n ) 


has a local extremum at jq = at. At that point 


^ (A) =°. 

OXi 


Since this is true for each i - this completes the proof. 


□ 
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Fig. 4.2 Left: A graph of /, that has a local maximum at ( a\, 02 ). Center: has a local 

maximum at a\. Right: f(a\,X 2 ) has a local maximum at «2- 


Example 4.3. Consider the graphs in Figure 4.3 of 

f(x,y) = x 2 +y 2 , g(x,y) = -x 2 -y 2 , h(x,y)=y\ 

f has its smallest value at the point (0,0), and its partial derivatives f x = 2x 
and f y = 2y are zero there, g has its largest value at (0,0), and again the partial 
derivatives g x (0,0) = 0 and g^(0,0) = 0. It is also true that 

V/*(0,0) = 0 

since Wh(x,y) = (0,3 y 2 ). But h has neither a maximum nor a minimum at (0,0): 

h(0 ,y)=y 3 

is positive for some points and negative for other points arbitrarily close to 

(0,0). □ 


z = f(x,y) 




Fig. 4.3 Graphs of f,g,h in Example 4.3. 
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A second derivative test. As we saw in Example 4.3, a function / from W 1 to R 
that has V/(A) = 0 may have a local maximum, a local minimum, or neither at A. 

We turn to the second partial derivatives to test for local extrema at interior points 
of the domain. Recall the case of functions of a single variable: 

Taylor’s Theorem for a twice continuously differentiable function gives 

f"( c ) 

f(a + h) = m+f(a)h+ J —^h 2 
for some c between a and a + h. If f'(a) = 0 we get 

f"(r) 

f(a + h)-f(a)= J —^h 2 . 

Since f" is continuous and f"(a) is positive, then f" is positive at all points close 
enough to a. By taking h small enough, c will be close to a , and /"(c) > 0. In that 
case 

f(a + h) = f{a) + > f(a). 

Thus / has a strict local minimum at a. Similarly if f'(a) = 0 and f"(a) < 0, then 
/"(c) is negative for h small enough and 

f(a + h) = m+^-h 2 <f(a) 
so / has a strict local maximum at a. 

A function / from R n to R has many second partial derivatives. We arrange them 
as a matrix: 


Definition 4 . 3 . Given a C 2 function from an open set in R n to R, the n by n 
matrix of second partial derivatives at (x\ ,..., x n ) 


fx\X\ fx 1 X 2 
fx 2*1 fx 2X2 

' fx\x n 

■ fx 2X n 



fx n x 1 fx n x 2 

■ fx n x n - 


is called the Hessian matrix of / at (x \,..., x n ). 




7/ is a symmetric matrix since f XiXj = f XjXi . 

We introduce the notion of positive definite matrix to use information about the 
second partial derivatives of / at X. We begin with the two-dimensional case. 









4.2 Extrema of functions of two variables 


169 


Definition 4.4. A symmetric 2x2 matrix S, 


S = 


p q 

q r 


is called positive definite if the associated quadratic function 


S(u,v) = U SU 

is positive for all U = (u,v) ± 0 in R 2 . The matrix S is negative definite if -S 
is positive definite, and S is indefinite if U • SU has both positive and negative 
values. 


The next result is an important property of positive definite matrices. 


Theorem 4.3. Let S = 


p q 

q r 


. S is positive definite if and only if there exists a 


positive number m such that for all ( u , v) in ] 


S(u,v ) = [u v] 


p q 

u 

q r 

V 


= pu 2 + 2 quv + rv 2 > m(u 2 + v 2 ). 


(4.1) 


Proof If there is such a number m, then certainly S (m, v) > 0 for all ( u , v) ^ 0, and S 
is positive definite. 

Conversely, suppose S is a positive definite matrix. Since S ( u , v) is a polynomial 
in u and v, S is a continuous function in R 2 and in particular on the unit circle 
u 2 + v 2 = 1. By the Extreme Value Theorem, S attains a minimum m at some point 
(c, d) on this circle. The minimum m = S(c,d ) is positive since (c, d) =£ 0 and S is 
positive definite. It follows that (4.1) holds for all (m, v) on the unit circle. 

Next we show that (4.1) holds for every vector (m, v). Vector (m, v) can be written 
as a multiple of a unit vector (z, w): 

(u,v) = a(z,w) = (az,aw), a= s/u 2 + v 2 , z 2 +w 2 = l. 

S ( u , v) is a 2 times S (z, w) because 

S ( u , v) = S ( az , aw) = pa 2 z 2 + 2 qazaw + ra 2 w 2 
= a 2 (pz 2 + 2 qzw + rz 2 ) = a 2 S (z, w). 

For (z, w) on the unit circle we have shown that S (z, w) > m = m(z 2 + w 2 ). Therefore 
S' (u, v) = a 2 S (z, w) > a 2 m(z 2 + w 2 ) = m(w 2 + v 2 ), 


which proves the inequality (4.1). 


□ 
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The following result is a consequence of Theorem 4.3: 


Theorem 4.4. Suppose that a symmetric matrix S is positive definite , and for 
all U = ( u , v) in R 2 

S (u, v) = U • SU > m(u 2 + v 2 ). 

Then every symmetric matrix of the form S + T, where the elements of the 
symmetric matrix T are small enough , is positive definite and 

(S + T)(u, v) = U • (S + T)U > y (w 2 + V 2 ). 


Proof. First we show that if the entries of the symmetric matrix T are small enough, 
the associated quadratic function T(u,v) satisfies 


T(u,v) > -y(> 2 + v 2 ). 


Denote 


T = 


a b 


a b 

u 

b c 

, T(u,v) = [uv] 

b c 

V 


= au 2 + 2buv + cv 2 . 


If h > 0 we use the fact that (u + v) 2 = u 2 + 2uv + v 2 > 0 to get 2 buv > b{-u 2 - v 2 ) and 

r\ 0 0 0 0 0 

au + 2 buv + cv > au + b(-u - v ) + cv 
= (a - b)u 2 + (c - Z?)v 2 . 

If £ < 0 we use the fact that (u - v) 2 = u 2 - 2 uv + v 2 > 0 to get b(u 2 + v 2 ) < 2buv and 

aw 2 + 2buv + cv 2 > au 2 + Z?(w 2 + v 2 ) + cv 2 
= (a + £>)z/ 2 + (c + Z?)v 2 . 

Take |«|, \b\, and |c| all less than j. Then \a — b\ < M + |&| < f, so a-b > -j. Similarly 
for c - b, a + b , and c + b. Therefore when \a\, \b\, and \c\ are all less than j 

T (u, v) = au 2 +2buv + cv 2 


m / 2 2\ 

> --(w 2 + v 2 ). 


Now add the inequalities for S and T to get 

(S + T)(u , v) = ^ (u, v) + T (u, v) 


> m(i/ 2 + v 2 ) - — (u 2 + v 2 ) = — ( u 1 + v 2 ) 


for all (u,v). Since y > 0 the matrix S + T is positive definite. 
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Theorem 4.5. (Second derivative test) Suppose f is a C 2 function on the 
interior ofD c R 2 and that at some interior point ( c,d) ofD, 


Vf(c,d) = 0. 


Let P(f(c,d) be the Hessian matrix of second partial derivatives off at ( c,d), 


Wc,d) 


fxx(c,d) fxy(c,d) 
_fy X (C,d) fyy(C,d) 


(a) If*Hf(c,d) is positive definite then f has a strict local minimum at ( c,d ). 

(b) If<Hf(c,d) is negative definite then f has a strict local maximum at ( c,d ). 

(c) IfHlfic^d) is indefinite then f has neither a local maximum nor local min¬ 
imum at (c, d). 


Proof (a) Suppose Pif(c,d) is positive definite. By Theorem 4.3 


[u vfHfM 


> m(u 2 + v 2 ) 


for some m > 0. In the proof of Theorem 4.4 we showed that if the difference T 
of the matrices LHf(c,d) and Pif(x,y) has entries that have absolute value less 
than f then 

<Hf(x,y) = <Hf(c,d) + T 

is positive definite. Since / has continuous second partial derivatives we can find 
a disk of radius r centered at ( c,d ) where all these differences are within j. 
Let (v,y) be a point of the disk and let ( u,v ) be the vector from (c, d) to (x,y), 
C u , v) = (x-c,y-d). Define a function of the single variable t on an open interval 
containing 0 and 1 by 

git) = f(c + tu,d + tv). 

The function g evaluates / along the line segment from (c, d) to (v,y), with 


g(0) = f(c,d), g(l) = f(x,y). 


Differentiating with respect to t , the Chain Rule gives 

g {t) = uf x (c + tu,d + tv) + vf y (c + tu,d + tv) 


and 


g"(t) = u 2 f xx (c + tu,d + tv) + 2 uvf xy (c + tu,d + tv) + v 2 f yy (c + tu,d + tv). 


By Taylor’s Theorem for functions of a single variable we have 
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f(x,y) = g(l) = g(0) + g'(0)(D + ^(l) 2 

for some 0 between 0 and 1. Since g'(0) = uf x (c,d) + vf y (c,d) = Ou + Ov = 0 we 
have 

f(x,y) = f(c,d) + (4.2) 

g"(0) = u 2 f xx (c + 0u,d + Ov) + 2 uvfxy(c + 0u,d + Ov) + v 2 f yy {c + 0u,d + Ov) 

= \uv] < Hf(c + 0u,d + 0v) U 

where (c + 0u,d + Ov) is on the segment between ( c , d) and ( x,y ) and hence within 
distance r of (c, d). Since *Hf(c + 0u,d + Ov) is positive definite g"(0) is positive. 
By equation (4.2) we get 

f(x,y)> f(c,d). 

(b) If d) is negative definite apply part (a) to the function -/. The function / 
has a strict local maximum wherever -/ has a strict local minimum. 

(c) If 7T/(c,J) is indefinite then 

g"(0) = u 2 f xx (c, d ) + 2wv/xy(c, d) + v 2 fyy(c, d) 

is positive for some values of ( u,v) and negative for others. For r small enough, 
g"(0) is close enough to g"(0) so that g"(0) will be positive for some values of 
(, u,v) and negative for others. So for some points (x,y), f(x,y) > f(c,d) and for 
others f(x,y) < f(c , d). That is, at (c, d) the function / has neither a maximum nor 
minimum, but V/(c, d) - 0. Such a point is called a saddle point for /, a name 
dating back to the days when people rode horses. 

□ 



The next theorem gives a useful criterion for a 2 by 2 matrix to be positive definite 
or negative definite. 
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Theorem 4.6. The symmetric matrix 

s = \ pq 

q r 

is positive definite if 

p > 0 and pr-q 2 > 0. 

S is negative definite if p < 0 and pr-q 2 >0 and S is indefinite if pr-q 2 < 0. 


Proof. The quadratic function associated to S is 

S ( u , v) = pu 2 + 2 quv + rv 2 . 

For v ^ 0 


S{u,v) = v 2 {^p(ff +2cf + f 


Let /(f) = pr + 2gf + r. Then 


v 2 f(~) = S(u,v) 

Rewrite /(f) by “completing the square” 

f(t)=p 

Now f(‘f ) is positive, and therefore S(ii.v) is positive, if 


^ 2 +2-f + (-) 2 ) + r - — =p(f+-) 2 +(r- — 
P \P P \ P \ P 


p> 0 and r - y > 0, 

that is p > 0 and pr-q 2 > 0. And /(“) is negative, and therefore S ( u ,v) is negative, 
if 

a 2 

p < 0 and r - y < 0, 

that is p < 0 and pr-q 2 > 0. The same criteria apply when v = 0 because 


S/^O) = pu 2 


is positive or negative with p. 


Now consider the case pr - q 1 < 0. If p > 0 then < 0- Pick ( u , v) so that the 


ratio ^ is large enough that 




P 
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In this case /(“) > 0. Then S(u,v) > 0. 

Alternatively, pick (w,v) so that the ratio ^ - | then 

m = 0 + r- q j. 

9 a 1 

Since p > 0 and pr-q z < 0, /(**) = r - y < 0 and we get 

S( M ,v) = v 2 /(f)<0. 

Thus for p > 0 we have shown that S(u,v) is positive for some (w, v) and negative 
for other (w, v). If p < 0 use a similar argument to show that again S (u,v) is positive 
for some ( u , v) and negative for other (u, v). □ 

Example 4.4. Find the local extrema of f(x 9 y ) = x 2 + 2x + y 2 + 2. The gradient 

V/(x,y) = (2v + 2,2y) 


so V/(-l,0) = (0,0). The Hessian matrix of / at (-1,0) is 


W~ 1 , 0 ) 


' fxxi-m fxyi-m 


2 O' 

fyx(~ 1,0) /„(-!,0) 


02 


Since /„( -1,0) = 2 > 0 and f xx (- 1, 0)f yy (- 1,0) - / 2 (-1,0) = 4 > 0, <Hf{- 1,0) 
is positive definite and / has a strict local minimum at (-1,0). Since V/(x,y) 
exists at all points (x,y) of R 2 and is zero only at (-1,0), there is only one 
local extremum. □ 

Example 4.5. Find the local extrema of f(x,y ) = x 2 -y 2 . The gradient 


Vf(x,y) = (2x,-2y) 


so V/(0,0) = (0,0). The second partial derivatives of / are 


fxx(x,y ) = 2, fyy(x,y) = -2, fxy(x 9 y) = f yx (x 9 y ) = 0 . 

The matrix 

2 0 

W(0,0)= Q _ 2 

has fxx(0 9 0)fyy(0 9 0)-f^(0 9 0) = (2)(-2)-0 2 < 0, so / has a saddle at (0,0). 
See Figure 4.4 for a sketch of the graph of /. □ 

Example 4.6. Find the local extrema of f(x,y ) = 2x 2 - xy + y 4 . The gradient 
V/(v,y) = (4v-y,-v + 4y 3 ) 

is zero when x = \y and -v + 4y 3 = 0. These imply -|y + 4y 3 = 0. The solu¬ 
tions are y = 0, ± |, and since v = the gradient is zero at 
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(0,0), (16> 4)’ ( 16’ 4)’ 


The Hessian 44f(x,y) = 


4 -1 
-1 12y 2 


so 


44f (0,0) = 


4 -1 
-1 0 




4 -1 
-1 ^ 

1 16 J 


w/(-£,-*) = 


4 -1 

-1 ^ 
1 16 


Since 4(0) - (-1) 2 < 0, / has a saddle at (0,0). Since 4 and 4(j|) - (-1) 2 are 
positive, / has a local minimum at (^, |). The last two Hessians are the same, 
therefore / also has a local minimum at (- ^, -1). □ 

Example 4. 7. Consider the three functions 

f(x,y) = x 2 , x 3 , -v 2 . 


Each has V/(0,0) = 0 and Hessian 


W(0,0) 


00 

00 


44 f( 0,0) is not positive or negative definite and it is not indefinite. But for 
f(x,y ) = x 2 , f( 0,0) is a local minimum; for f(x,y ) = v 3 , /(0,0) is neither a 
maximum or a minimum; and for f(x,y ) = -x 2 , /(0,0) is a local maximum. 
In these examples the Hessian gives no information. □ 


Problems 


4.13. Let f(x,y) = (x - l) 2 + 2(y - 2) 2 + (y - 2) 3 = 1 - 2*+ v 2 + 4y - 4y 2 +y 3 . 

(a) Calculate V/ and find the two points where V/ = 0. 

(b) Calculate the Hessian matrix 44f of / at the two points in part (a). 

(c) Determine whether / has a local maximum, minimum, or saddle at the points 
you found. 

4.14. A symmetric 2 by 2 matrix A has been computed numerically with small errors 
as a symmetric matrix S, and each /, j entry of S is within 10 -3 of the /, j entry of A. 
If S (u, v) > 3 x 10 ~ 2 {u 2 + v 2 ) show that A is positive definite. 

4.15. Write the following quadratic functions in the form X • SX for some symmetric 
matrix S. 

(a) 3x 2 +4vi^2 + x 2 

(b) -x 2 + 5viX2 + 3^2 

4.16. Let f(x,y ) = x 2 + 2xy + y 3 . Determine all local extrema. 
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Fig. 4.5 The sum of distances from P to A, B, and C is minimized in Problem 4.18. 

4.17. Let f(x,y ) = -x 3 + x 2 + xy + 3 y 2 . Show that /(0,0) is a local minimum. 

4.18. Let A = (ai,< 22 ), B = and C = (c\,C 2 ) be three points in R 2 , and let 

f(x,y ) be the sum of the distances from point (v,y) to A, B, and C. 

(a) Show that if V/(P) = 0 then the sum of the unit vectors 

P-A P-B P-C 
HP-All + IIP-BII + IIP-CII 

(b) Show that the sum of three unit vectors is zero if and only if the angle between 
each pair of them is 120 degrees. See Figure 4.5. 

4.19. Find a number m so that 

x 2 + qxy + y 2 

is positive definite when \q\ < m. 

4.20. Suppose a C 2 function f(x,y ) satisfies 

fxx +fyy = 0 . 

(a) Suppose also that f xx is not zero at any point. Show that / does not have any 
local maximum or minimum values. 

(b) Suppose that at every point (v,y), at least one of the second partial derivatives 
fxx , Ay, or f yy is not zero. Show that / does not have any local maximum or 
minimum values. 

4.21. Let f(x,y ) = x 2 + 2y 2 and g(x,y) = x 4 + 2y 2 . Show that / and g attain their strict 
local minimums at the origin. Show that the matrix of second partial derivatives 
7//(0,0) is positive definite but 7/g(0,0) is not positive definite. 

4.22. Find the point on the plane 


z = x - 2y + 3 

that is closest to the origin, by finding where the square of the distance between the 
origin and a point of the plane is a local minimum. 

4.23. Confirm that the point found in Problem 4.22 is also a normal vector to the 
plane there, and make a sketch to show why this is the case. 
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4.3 Extrema of functions of several variables 


Theorems developed in Section 4.2 for functions of two variables can be extended 
to functions of several variables. 

Taylor’s Theorem Recall that a function / from an open set in R w to R is differen¬ 
tiable at A if /(X) can be well approximated by /(A) + V/(A) • (X - A), 


/(X) = /(A) + V/(A) • (X - A) + R(X - A) 

where ^ tends to zero as ||X-A|| tends to zero. Let 
l|X — A|| 

n 

Pi (X) = /(A) + V/(A) • (X - A) = /(A) + f Xi (A)( Xi - a,)- 

i= 1 

pi is called the first order Taylor approximation of / at A. Let H = X - A, then 

n 

Pi (X) = pi(A + H) = /(A) + V/( A) • H = /(A) + £ ,f Xl (A)h, 

i—\ 


Example 4.8. Write the first order Taylor approximation of 

f(x , y, z) = sin v + 2y + e yz 

at (0,0,0), and use it to approximate /(. 1, .2, .01). 

The gradient of / is 


V/(*,y,z) = (cos v^ + ze^ye 3 ^) 

and V/(0,0,0) = (1,2,0). The first order Taylor approximation of / at (0,0,0) 
is 


pi (0 + H) = pi (hi , h 2 ,h 3 ) = /(0,0,0) + V/( 0,0,0) • {h x , h 2 M) = 1 + h + 2/z 2 , 

and /(.l, .2, .01) * /?i(.l, .2, .01) = 1 + .1 + .4 = 1.5. □ 

Suppose now / from R n to R is C 3 on an open set that contains an open ball 
centered at A, and let X = A + H be a point in the ball. Define the function 

*(f) = /(A + fH) 

for t in an open interval that contains 0 and 1. For 0 < t < 1 the points A + are 
on a line segment from A to A + H, and g(0) = /(A), g(l) = /(A + H). Since g is a 
C 3 function on an open interval that contains [0,1], we can compute g\ g", and g'" 
and write the order 2 Taylor approximation of g , with the remainder. By the Chain 
Rule, 
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8 

By the Chain Rule again 

, ( n \ 


\t) = V/(A + fH)-H = £/* ; ( A + tU)hi. 


i= 1 




At t = 0, 


i ( n \ n 

X h >u A + ?H ) =Z hi Z jw a+tH)h '= Z h >^4 A + fH )^- 

*=i V7=i / Uj =i 


V i=l 


?'(0) = £/;,(A)/1„ g"(0) = £ hif XiXj (X)hj = [h\h 2 ••• h n ] 


i= 1 


ij= 1 


/W A ) 


where (A)] is the Hessian matrix of / at A. By the Chain Rule again 


«"'(*) = 2 hihjhkf^A + tH). 
ij,k= 1 

According to Taylor’s Theorem for functions of a single variable, there is a number 
6 between 0 and 1 so that 

g(D = *( 0 )+g'(0) + \g"( 0) + y\e). 

This expresses /(A + H) as the order 2 Taylor approximation , /? 2 (A + H), plus a 

hi 1 

remainder. Denote by H the column vector ; and denote by H r the row vector 

- h n _ 

(h \,..., h n ). Then we have 

/(A + H) = /(A) +V/(A)-H + iH r [/ A ,. A .(A)]H + R 2 (A,H) = p 2 (A + H)+R 2 (A,H)- 
By the triangle inequality 

n n 

|R2(A,H)| = A ^ hihjh k f XiXjXk (A + 0H) < A 2 |AiW^ t (A + eH)|. 


U,*=i 


i,j,k=l 


Each \hi\ < ||H||, so we get |R 2 (A,H)| < jil|H|| 3 |/ w *(A + 0H)|. 

iJM 1 

Since the third partial derivatives are continuous on the closed ball of radius ||H|| 
centered at A, they are bounded. Let K be the sum of the bounds of the third partial 
derivatives. Then 

|R 2 (A,H)| < j[A||H || 3 = £||H|| 3 . 
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wu n ^ l^2(A,H)| |7? 2 (A,H)| „ .a . 

When H 1 0 then 0 <--— < k HII, so--— tends to zero as H tends to 

l|H|| 2 ||H|| 2 

zero. 

For C m functions we define the order m Taylor approximation to / at A as 

n n 

pJA + H) =/(A) + ^ h H f Xi] (A) + \ ^ h h h i2 f XiiXi2 ( A) 
h =1 hJ 2=1 

n 

4 h 2 ^*2 ’ ’ ’ )A'l -x im (A)- 

! lv4 = l 

An analogous argument to the one we used to prove Taylor’s second order 
approximation can be made to prove the following theorem. 


Theorem 4.7. (Taylor ) Suppose f from R n to R is a C m+l function on an open 
ball centered at A. Then for A + H in the ball, 

n n 

/(A + H) = /(A) + 2 h h /*. (A) + \ 2 h h h i2 f Xii Xi2 (A) 

*1 = 1 * 1**2 = 1 


4 m ! i hi 2 • • • hi m )fx h xi 2 • • -Xi m (A) + Rm (A, H), 

h vj*'m = l 

where |/? m (A,H)| < &||H|| m+1 /br constant k. The remainder goes to zero 
faster than ||H|| m in the sense that 


0< 


|*m(A,H)| 

iiHir 


<*|H||. 



Fig. 4.6 Graphs for Example 4.9. Left: f and its Taylor polynomial p \. Right: /, p\, and p 2 . 


Example 4.9. Let f(x,y) = x 2 + \y 2 + |(v 4 +y 4 ). Find the first and second order 
Taylor approximations to / at (1,1). 
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/(1> 1) = 4, V/(x,y) = (2x + 3x 3 ,3y + 3y i ), V/( 1,1) = (5,6), 


W/(x,jO = 


2 +9.x 2 0 

0 3 + 9/ 


*W/(U) = 


11 0 
0 12 


So 


p 1 (l+Ai,l+A 2 ) = /(U) + V/(l,l)-(Ai,A 2 ) = 4 + 5A 1 +6A2, 
p 2 (l +*i,l + * 2 ) = px(\+hu\+h 2 ) + \[hx h 2 YHf(\, 1) 


— 4 + 5 h\ + 6*2 4/ + 6 * 2 - 

We can also write /?i and p 2 in the form 


P\(x,y) = 4 + 5(x- l) + 6(y-1), 

P 2 (x,y) = 4 + 5(x- l) + 6(y-l) + ^(x- \) 2 +6(y- 1) 2 . 

Figure 4.6 shows graphs of /, £>i, and p 2 . 

Example 4.10. Find the second order Taylor approximation to 

/(*, y, z) = sin v + 2y + e^ 


at A = (0,0,0), and use it to estimate /(. 1, .2, .01). Recall we found 
V/(v,y,z) = (cos x,2 + ze yz ,ye yz ) 


□ 


and V/(0,0,0) = (1,2,0) in Example 4.8. The second partial derivatives are 



-sinv 0 

0 


00 0 

*Hf(x,y,z) = 

0 zV z 

(l+yz)e yz 

, 91 f (0,0,0 ) = 

00 1 


0 (1 + vz)e - v - 7 

y 2 e>' z 


0 1 0 


The second order Taylor approximation to / at (0,0,0) is 


p 2 (H) = /(0) + V/(0,0,0).H+iH r 


000 
00 1 
0 1 0 


H — 1 + h\ + 2 /z 2 + /z 2 /l3, 


and p 2 (.l, .2, .01) = 1 + .1 +2(.2) + (.2)(.01) = 1.502. □ 

Extrema. Suppose / is a C 3 function from an open set in R n to R, and that the 
gradient of / is zero at a point A. By Taylor’s Theorem 4.7 

/(A + H) = / (A) + j H r [/*.*; (A)]H +R 2 (A, H) 
where |/, Pj (A)] is the Hessian matrix of second partial derivatives of / at A, and 
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|fl 2 (A,H)| < fc||H || 3 

for some number k. Since [/^(A)] is symmetric we can determine whether / has 
a strict local minimum or strict local maximum at A by considering the sign of 
H r [f XiXj (A)]H. Suppose first that the matrix is positive definite, that is 

H r [f XiXj (A)\H>m\\H\\ 2 

for some number m> 0. Then 

/(A + H)-/(A) = 5 H r \f Xi xj (A)]H + 7? 2 (A,H) > ±m||H || 2 + R 2 (A,U). 

By Taylor’s Theorem | 7 ? 2 (A,H)| < &||H|| 3 , so 

/(A + H)-/(A)> ±m||H|| 2 +tf 2 (A,H)> ±m||H|| 2 -£||H || 3 = m||H|| 2 (± - i||H||X 

Since \ - ^||H|| is positive for ||H|| small enough, we see that / has a strict local 
minimum at A. This proves the second derivative test, that we state as the following 
theorem. 


Theorem 4.8. Second derivative test. Let f be a C 3 function in an open set 
ofW 1 containing A. IfWf(A) = 0 and the Hessian matrix [/ X/X/ (A)J is positive 
definite at A, then f has a local minimum at A. 


Looking further at our proof of Theorem 4.8, we can make an observation about 
the sign of the error in the approximation /(A + H) « /(A) + V/(A) • H in the general 
case where V/(A) might or might not be 0. We state this in the following theorem. 

Theorem 4.9. Let f be a C 3 function in an open set of R n containing A. If the 
Hessian matrix |/ X/X; .(A)] is positive definite at A, then the degree one Taylor 
approximation 

/(A + H) * P1 (A + H) = /(A) + V/(A) • H 
is an underestimate for all sufficiently small H, 

/(A + H) > /?!(A + H). 


Proof By Taylor’s Theorem 

/(A + H) - Pl (A + H) = iH r [f XiXj ] H + R 2 ( A, H) 

where |7?2(A,H)| < &||H|| 3 for some k > 0. Since the Hessian is positive definite there 
is some m > 0 so that 
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/(A + H) - P1 (A + H) > im||H|| 2 - fc||H|| 3 = m||H|| 2 (*-* ||H||). 
Since ^ - ^||H|| is positive for ||H|| small enough we see that for such H, 

/(A + H) > pi (A + H) 


and p\ is an underestimate for /. 


□ 


In order to apply the second derivative test, Theorem 4.8, it is useful to have a 
criterion for determining whether a symmetric matrix is positive definite. We state 
the following result that generalizes Theorem 4.6. 


Theorem 4.10. Let M = [m^] be a symmetric matrix , nxn. Suppose 


m \\, 


det 


mu ^12 

m2i m22 


det 


mu JWi2 mi3 
m2i m22 ^23 
m3i m 32 m 33 


detM 


are a// positive. Then M is positive definite. 


For a proof of Theorem 4.10 please consult a text on matrix theory. 

Example 4.11. Let f(x,y,z) = x 2 + y 2 + z 2 + 2xyz. At which of the points 
(0,0,0), (1,1,1), (-1,-1,-1) does / have a strict local minimum? 

Vf(x, y, z) = (2x + 2 yz, 2 y + 2 xz, 2 z + 2 xy) 

so V/(0,0,0) = (0,0,0), V/(l, 1,1) = (4,4,4), V/(-l,-l,-l) = (0,0,0), Since 
V/( 1,1,1)^ 0, f (1,1,1) cannot be a local minimum. The matrices of second 
partial derivatives are 


“2 2 z 2y 


2 0 0 


_ 2 -2 -2" 

2 z 2 2x 

, 'Hf (0,0,0) = 

0 2 0 

, -1) = 

-2 2 -2 

2y 2x 2 _ 


0 0 2 


-2 -2 2 


Since U r< H/(0,0,0)U = 2||U|| 2 is positive except when U = 0, 44f( 0,0,0) is 
positive definite and /(0,0,0) is a strict local minimum. We could also use the 
determinant criterion to see that T4f( 0,0,0) is positive definite, since 2 > 0, 


det 


2 0 
02 


= 4 > 0, and det94f(0,0,0) = 8 > 0. At the point (—1,—1, — 1) we 


check for positive definiteness of T4f(- 1, — 1, — 1) by the determinant crite¬ 
rion: 


2 > 0, det 


2 -2 
-2 2 


= 4-4 = 0, det 


2 -2 -2 
-2 2 -2 
-2 -2 2 


= 2(0) + 2(—4 - 4) - 2(4 + 4) = -32. 


Since the determinant of the 2 by 2 part is zero the theorem gives no infor¬ 
mation. If we experiment we find 
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T 


T 

[10 0]7f/(-l,-l,-l) 

0 

= 2, [1 1 

1 


0 


1 


so the matrix is indefinite, and /(-l,-1,-1) is a saddle. □ 


Problems 


4.24. Let /(X) = C • X be a linear function from R n to R, and let A be a point in R n . 
Show that the first and second order Taylor approximations to / at A, p\ and p 2 , are 
equal to /. 

4.25. Let f(x,y,z ) = e J log(l +x) + sinz. 

(a) Show that / has no local extrema. 

(b) Find the second order Taylor approximation /?2(^i,^2,^3) to / at (0,0,0). 

4.26. Let 


Find the first order Taylor approximation to/atA = (^,^,^). 

4.27. Let f(x, y, z) = x 2 + xy + \y 2 + 2yz + z 3 . 

(a) Show that V/ is the zero vector at 0 and at one other point A. 

(b) Use Theorem 4.10 to show that / has a local minimum at A. 

(c) Find points arbitrarily near (0,0,0) where / is positive and where / is negative, 
to show that / has a saddle at (0,0,0). 

4.28. Consider the set S of points in (x,y,z) space that satisfy 

z 2 = x 2 + 2y 2 + l. 


Let (< a , b , 0) be a point in the z = 0 plane. Find the local extrema of a function of (x,y) 
to determine the points of S that are closest to (a,b, 0). 

4.29. Let /(X) = ||X|| _1 for X ^ 0 in R 3 , and let A be a nonzero vector in R 3 . 

(a) Find all first and second order partial derivatives of /. 

(b) Find the second order Taylor approximation to / at A. 


4.30. Use Theorem 4.10 to show that the following matrices are positive definite. 


(a) 


2 -1 
-1 1 


(b) 


2 -1 0 
-1 1 0 
0 04 
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(c) 


2 -1 0 
-1 1 k 
0 k 6 


k 2 < 3 


4.31. Let /(jti, X 2 , X 3 ,X 4 ) = x\ + X 2 + V 3 + X 4 + V 1 v 2 .x 3 .x 4 . Find the order 1 through 5 
Taylor approximations p\,P 2 ,P 3 >P 4 iP 5 for / at A = (0,0,0,0). 


4.4 Extrema on level sets 


The Extreme Value Theorem 2.11 guarantees that a continuous function / from a 
closed and bounded set D c R n to R has an absolute maximum and absolute mini¬ 
mum value. In Section 4.2 we saw that if a C 1 function / from D c R n to R has a 
maximum or minimum at an interior point A then V/(A) = 0. This “first derivative 
test” gives us a way to find candidates for extrema that occur in the interior of D. 
In this section we develop a method for identifying candidates for extrema of a C 1 
function / on a level set that has no interior points in R n . We state and prove the the¬ 
orem for functions from R 3 to R but an analogous result is true in R 2 and in higher 
dimensions. 

Recall that a value /(P) is a local extremum of / on a set S , where P is in S , if 
/(P) > /(Q) for all Q in S near P, or if /(P) < /(Q) for all Q in S near P. 

Theorem 4.11. Lagrange multiplier. Let f and g be C 1 functions from an 
open set in R 3 to R, and denote by S a level set g(x,y,z) = c. Let P be a point 
of S such that 

(a) Vg(P) ± 0 , and 

(b) f has a local extremum on S at P. 

Then there is a number A such that 

V/(P) = TVg(P). 


Proof By the Implicit Function Theorem it follows from (a) that there is a portion 
of S containing P where one of the variables (x,y,z) can be expressed as a function 
of the other two, say g z (P) ^ 0 and z = <f>(x,y). Then 

g(x,y,<p(x,y)) = c. 

Since / has a local extreme value at P = (pi,P 2 ,P 3 ), the function 

h(x,y) = f(x,y,(p(x,y)) 

has a local extremum at the center of an open disk about (p\,p 2 )- Therefore the 
derivatives of h(x,y) with respect to v and y are zero at (pi,p 2 )’ 
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df dfdf 
h x — —— I - ———— — 0, 
dx dz dx 

_dj_dj_df_ 
y dy dz dy 

We have by differentiating g(x,y,0(x,y)) 

= c that 

^ + -0 
dx dz dx 

-o 

dy dz dy 

Therefore, using g z 4 0, 


n . d 8 

# _ _dx_ 

df _ dy 

dx~ §8 9 
dz 

dy ~ <>&' 

dz 


Substitute these formulas for the partial derivatives of 0 at (puP 2 ) into formula (4.3) 
for h x and h y . We get at P, 


df + df 
dx dz 


^6 

dx 

= o, d -i + d -L 

_dy_ 

dg 
dz l 

dy dz 

dg 

V dz J 


df 

Let A = -^-(P). Then V/(P) = TVg(P) as asserted. □ 

dz 


f = 3 . 



Example 4.12. Find the rectangular box of largest volume that has sides par¬ 
allel to the x,y and a axes and that can be inscribed in the ellipsoid given 
by 

5x 2 + 3y 2 + 7z 2 = 1. 

We maximize the volume, f(x,y,z ) = (2x)(2y)(2z) = 8 xyz 9 for (x,y,z) in the 
level set g(x,y,z ) = 5x 2 + 3 y 2 + Iz 2 -1=0, where ( x,y 9 z ) is in the first octant. 
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The equation g = 0 is often called a constraint. It is possible in this problem 
to solve the constraint g = 0 for z in terms of ( x,y ) and use the methods of 
Section 4.2. But let’s use the Lagrange multiplier method. We solve the system 
of equations 

5x 2 + 3y 2 + 7~ 2 -1=0, V/ = (8 yz, 8 xz, 8xy) = AVg = A( 1 Ox, 6y, 14 z) 

by considering cases. Either A = 0 or not. If A = 0 then at least two of the 

coordinates x,y,z are zero, so the volume / is zero. So assume A ± 0. We 

consider only points where x,y,z are all nonzero since we seek a maximum 
volume. Then we can form quotients; we find 

8 yz y T(lCbc) 5x 8 yz z T(10v) 5x 

8 vz v A(6y) 3y’ 8 xy x A(14z) lz ’ 

so y 2 = |x 2 and z 2 = jx 2 . The constraint g = 0 then gives 


5a 2 + 3j 2 + 7z 2 -1 = 5a 2 + 3(§a 2 ) + 7(§x 2 )-1 = 15x 2 -l =0. 


This gives as candidates for the maximum, the eight points with coordinates 


V3V5’ ^ V3V3’ V3V7' 


Since the domain of / is the first octant we take all the plus signs. The max¬ 


imum volume of the box is then /( 


J 7= , J 7= , -p-p) = —for a box that 
V3V5’ V3V3’ V3V7 7 9V35 


measures 


by 


V3V5 J V3V3 


Example 4.13. Let Q = 


by 

P 4 
q r 


V3V7' 


and consider the quadratic function 


G(X) = X • QX = [x y] 


P 4 

X 

q r 

y_ 


= px 2 + 2 qxy + ry 2 


on the unit circle ||X|| 2 - 1 = x 2 + y 2 - 1 =0. Since Q is continuous and the 
circle is closed and bounded, Q has a maximum on the circle. If X is a point 
where Q is maximum then according to Theorem 4.1 

V0(X) = 4V(||X|| 2 ). 


or (2px+2qy, 2qx+2ry) = A(2x, 2y). After dividing by 2, this is neatly expressed 
as 

QX = AX. 

Such a number A is known as an eigenvalue of the matrix Q, and vector X is 
a corresponding eigenvector. Dot the equation QX = AX with X to get 
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X • QX = /l||X|| 2 = A, 

which shows that this eigenvalue is the maximum value of the quadratic func¬ 
tion on the circle. 

This example also shows that every 2 by 2 symmetric matrix has an eigen¬ 
value. □ 

Example 4.14. Suppose there are three commodities A, B, C that have unit 
prices p,q,r , and are purchased in amounts x,y,z 9 respectively. The budget or 
“wealth constraint” is 

px + qy + rz = w (4.5) 

where w > 0 is given. Let U(x,y,z ) = x a y b z c be the utility function measuring 
consumer satisfaction from consuming x amount of A, y of B, and z of C, 
where a,b , and c are positive. By Theorem 4.11 the maximum of U given the 
wealth constraint satisfies 


ax a -'y b yf = Ap 
bx a y h ~ 1 z c = Aq 
cx a y b z c ~ l = Ar 

w = px + qy + rz. 


If A = 0 then at least one of x,y or z is 0 and the utility is zero. If A 4 0 multiply 

x y z 

the first three equations by -, -, -, respectively, to get 

a b c 


av : 


■ A-p = A-q = A-r. 
a b c 


Divide by A to get 


x y z 

~P = uV = ~ r ' 

a b c 


(4.6) 


So yq- |xp and zr - ^xp. Substituting this into the wealth constraint (4.5) 
we get 

w = px( 1 + - + -), 
v a a / 


so that px = w- 


a + b + c 
A, B, C are according to (4.6) 


. The amounts the consumer spends on commodities 


px = 


aw 


a + b + c ’ 


qy = 


bw 


cw 


a + b + c ’ 


rz = 


a + b + c 


respectively. □ 

Example 4.15. Suppose there are N particles each having one of the energies 


e\, £2, 
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Let Xi denote the number of particles with energy C[. The number of ways this 
can be arranged is known to be 

TI7 (-M + X2 4- ^-X m )\ 

Vv =- 

X\\X2\'''X m \ 

since x\ + • • • + x m = N is the number of particles. In physics one wants to 
maximize W assuming constant total energy 


g(xi,...,x m ) = eixi+e 2 X 2 + --- + e m x m = E. 


We maximize log W subject to the same constraint and use Stirling’s approxi¬ 
mation log(v!) ~ xlogx to approximate log W as 


fix i,...,x m ) = (xi + --- + x m )log(xi+--- + x m )-xi log x\ - x m log x m . 


By the ^-dimensional version of Theorem 4.11 a local extreme value of / 
occurs at a point X that satisfies 

V/(X) = (logOi + --- + v m )-logvi,...,log(vi + --- + x m )-logv m ] 


= dVg(X) = A(e \,..., e m ). 

This gives log(xi + • • • + x m ) - lo gXi = Aei , or Xi = e~ Aei . In the case 
where (x\ + • • • + x m ) = N we get 


Xi = Nt~ Aei . 

In applications to statistical mechanics, it turns out that the multiplier A is the 
reciprocal temperature! □ 


Problems 

4 . 32 . In calculus we learn that the farmer with 400 meters of fencing material can 
maximize the area of his rectangular field by making it square, 100 meters on each 
side. Use Lagrange multipliers to verify that result. 

4 . 33 . Show using Theorem 4.11 that the maximum value of the function 

Q(x,y) = 3x 2 + 2xy + 3y 2 
on the unit circle x 2 +y 2 = 1 is 4. 

4 . 34 . Use the Lagrange multiplier method to find the point of the line y = mx + b 
that is closest to (0,0). 
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4.35. Use the Lagrange multiplier method to find the point of the hyperplane 

C-X = 0 

that is closest to a given point A in R n . 



4.36. Differentiable functions / and g in the first quadrant have some level sets as 
shown in Figure 4.8. Determine from the figure: 

(a) The minimum of / on the curve g = 25. 

(b) The sign of the Lagrange multiplier A in (a). 

(c) The minimum of g on the curve f - 80. 

(d) The maximum of / on the curve g — 27. Does the equation V/ = AVg apply here? 


4.37. Let A = 


1 2 
2 -2 


. Express the quadratic function and constraint 


X AX, X X=1 


at point X = (x,y), in terms of x and y. Solve a Lagrange multiplier problem to find 
the constrained maximum of the quadratic function and express the results as an 
eigenvalue equation 


AX = AX. 


4.38. Use the Lagrange multiplier method to find the points of the curve 

x 3 +y 2 = 1 

that are closest to the point (-1,0). See Figure 4.9. 
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Fig. 4.9 The curve in Problem 4.38. 

4.39. Use the Lagrange multiplier method to find points of the level set 

x 3 +/ + z 2 = 1 

that are closest to the point (-1,0,0). 

4.40. Let f(x,y,z ) = x 2 y 3 z 4 - Use the Lagrange multiplier method to find the maxi¬ 
mum value of / on the plane 12v + 12_y + 24z = 1. 




Chapter 5 

Applications to motion 


Abstract The concepts and techniques of calculus are indispensable for the descrip¬ 
tion and study of dynamics, the science of motion in space under the action 
of forces. Both were created by Isaac Newton in the late seventeenth century and they 
revolutionized both mathematics and physics. In this chapter we describe the basic 
concepts and laws of the dynamics of point masses and deduce some of their 
mathematical consequences. 


5.1 Motion in space 


There are of course no point masses in nature—each body has a nonzero size—but 
in many situations a small body can be very well approximated by a point. Small 
here is a relative term; for instance in the study of the motion of planets around the 
sun the earth is regarded as a point mass. 

The position of a point in three-dimensional space is described by its three Carte¬ 
sian coordinates x,y, and z. These coordinates form a vector X in R 3 . When the point 
is moving, its position is described as a vector valued function of time, denoted 
X(0 = (x(f),y(t),z(f)). The function X parametrizes the curve on which the particle 
moves. Informally we may speak of the motion, or curve, X(t). 

As we saw in Section 3.1 the derivative with respect to time of the position func¬ 
tion is the velocity of the particle at time it is denoted 


V(f) = 


dX(Q 
d t 


x'(t). 


The norm of the velocity, ||V(f)||, is the speed of the particle. 

Example 5.7. Let X(t) = (cos t, sin t, t), -1 < t < 2n be the position of a particle 
at time t. Find the position, velocity, and speed of the particle at t = 0 and at 
t= |. 

V(t) = X'(t) = (-sin£,cos£, 1). 
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At t = 0 the particle is at X(0) = (1,0,0) and its velocity is V(0) = (0,1,1). At 
f = 5 the particle is at X(|) = (^=, ^=, §) and Y(f) = (- 9- The speed 

at time t is 

||V(0ll = ^/(-sin^ + Ccostp + l 2 = VI 

The particle moves with constant speed along the curve shown in Figure 5.1. 
□ 



Fig. 5.1 The curve in Example 5.1 showing position and velocity at times t- 0 and t — J. 


The derivative with respect to t of the velocity function is the acceleration of the 
particle at time t\ it is denoted 


dV(0 _ d 2 X(0 _ 
d t d t 2 


Example 5.2. Suppose a particle moves along a C 2 curve X(t) so that the speed 
is constant. We show that the velocity and acceleration are orthogonal to each 
other at each time t. Since the speed is constant we have ||X'(0ll = c , Squaring 
both sides we get ||X'(0ll 2 = X'(0 • X'(t) = c 2 . By the product rule 

d d(c 2 ) 

-(X'(t)-X'(t)) = X'(t)-X"(t) + X"(t)-X / (t) = 2X'(t)-X"(t)= =0. 

at at 


so X'(t) and X"(?) are orthogonal when the speed is constant. □ 

Newtonian mechanics. The basic notions of mechanics are velocity, acceleration, 
mass, and force. Force is a vector quantity, denoted as F. A force F, acting on a 
particle with mass m, accelerates it according to Newton’s law: 


In symbols 


Force equals mass times acceleration. 


d 2 X 


df 2 


(5.1) 


Equation (5.1) is called the equation of motion, Newton's Law of Motion. 
Next we prove the following important result. 
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Theorem 5.1. Suppose a particle moves in R 3 with position X in a field of 
force F(X) that is parallel to X, i.e., of the form 

F(X) = k(X)X 

where k(X) is a scalar valued function. Then the particle travels in a plane. 


Proof. First we show that for a particle moving in a force field that is parallel to X, 
the cross product of X(t) and X f (t) is a constant vector. Consider (X xX')'. By the 
product rule for cross products 

(XxX')' = X'xX'+XxX". (5.2) 


Since F(X) = &(X)X we get from Newton’s Law of Motion 


X" 


m 


Set this expression into equation (5.2). Since the cross product of a vector with itself 
is zero we get 


(XxX')' = Xx 



k(X) 


XxX 


m 


0. 


Therefore X x X' is a constant vector C. Since the cross product is orthogonal to 
each factor, 

C-X(0 = 0 


for all times t. If C ^ 0 the equation C • X = 0 is the equation of a plane through 
the origin, so the motion takes place in this plane. If C = 0 then X(r) and X'(t) are 
parallel to each other at each t and the particle moves along a line. Or X'(f) = 0 and 
the particle does not move. In any case the particle remains in a plane. □ 


Problems 

5.1. Find the velocity, the acceleration, and the speed of particles with the following 
positions. 

(a) X(0 = C constant, 

(b) X(f) = (*,*,*), 

(c) X(f) = (l-f,2-f,3 + f), 

(d) X(t) = (t,2t,3t), 

(e) X(t) = (t,t 2 ,t 3 ). 

5.2. A particle moves on a line. Let X(t) be its position at time t. 
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(a) Suppose X(t) = A + for some constant vectors A and B A 0. What is the loca¬ 
tion of the particle at time t- 0? 

(b) Find the velocity and acceleration at time t of the particle in part (a). 

(c) Suppose instead X(t) = A + (t 3 - t)B. Find the velocity and acceleration at time t. 
When is the speed of the particle zero? 

(d) At what times is the particle of part (c) located at X(0)? 

5.3. A particle moves on a circle of radius r in the plane according to 

X(t) = (rcos(ojt),rsm(ojt)) 


where r and co are constant. 

(a) Find the velocity and show that it is orthogonal to X(t) 9 hence tangent to the 
circle. 

(b) Show that the speed is no. 

(c) Find the acceleration and show that it is parallel to X(t) but directed toward the 
origin. 

(d) Show that the magnitude of acceleration is no 2 . 

5.4. A particle of unit mass moves in a gradient force field, so that 

X" = -Vp(X) 

for some function p. Show that the energy 

±|X'(f)ll 2 +MX(0) 


does not change with time. 

5.5. A particle moves with nonzero velocity along a differentiable curve X(t) on a 
sphere centered at the origin, so that \\X(t)\\ is constant. Show that its velocity is 
tangent to the sphere. 

5.6. Give an example of a particle motion X(7) = (x(£),y(0), a < t < b, on the circle 
x 2 +y 2 = 4, such that the acceleration is not toward the origin. 

5.7. Let A, B, K, and F be constant vectors in R 3 and let m be a number. Verify that 
the function 

X(t) = A + Br + Kt 2 

satisfies the equation of motion F = mX" when K = ^F. Find the relation of A and 
B to the initial position and velocity X(0), X'(0). 

5.8. Verify that the following functions satisfy the indicated equations of motion in 
R 3 . A and B denote constant vectors. 

(a) X(f) = A cos t is a solution of X" = -X, 

(b) X(0 = Asin(2f) is a solution of X" = -4X, 

(c) X(t) = Acos(30 + Bsin(30 is a solution of ^X" = -X. 
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5.9. For each motion X(7) given in Problem 5.8, describe a plane (See Theorem 5.1) 
in which the motion occurs. 

5.10. Consider the equation of motion 

X" = MX, 

where M is a constant matrix. Suppose there is a constant vector U and number co 
such that 

MU = -a? U. 

Show that X(t) = cos(cot)U is one solution of the equation of motion. 

Such a number -oj 2 is known as an eigenvalue of the matrix M and vector U is a 
corresponding eigenvector. 

1 — Q~ kt 

5.11. Verify that X(t) = C +- D, where k is a positive constant and C, D are 

k 

constant vectors, satisfies the equation of motion 

X" - -kX'. 

Find the relation of C and D to the initial position and velocity X(0), X'(0). Find the 
total displacement of the particle 


limX(0-X(0) 


in terms of the initial velocity. 

5.12. An ancient society once believed that the moon moved on a circle about the 
earth, being pulled across the sky by horses. In view of Newton’s law, and of Prob¬ 
lem 5.3 part (c), in which direction is the moon really being pulled? 

5.13. A particle moves in R 3 with velocity V = (vi,V 2 ,V 3 ), and acceleration 

V' = VxB, 

where B is some vector field. For example, positively charged particles follow such 
a rule when B is a magnetic field. Suppose B is a constant field, B = (0,0 ,b). See 
Figure 5.2. 

(a) Show that the components of acceleration are given by 

v[ = bv 2 
v' 2 = -bv\ 

V 3 = 0 (5.3) 

(b) Show that the position function X(0 = (asin(a>t),acos(cL>t),bt) satisfies equations 
(5.3) and that X"(0 = X'(f) xB. 

(c) Describe the direction of V'(0 at points along the helical curve X(t) given in 
part (b). 
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Fig. 5.2 Particle motions in Problem 5.13. 


5.14. Suppose a particle of mass m with position X(Y) at time t is joined by a spring 
to another particle of mass m with position Y (t). Then the equations of motion are 


mX" =k( Y-X), 
mY" = —k(Y - X) 

where k is a constant depending on the strength of the spring. Denote by W the 
vector W = (X, Y) = (x\,X 2 ,X 3 ,y\,y 2 ,y 3 ) with 6 components. Define the function p 
on R 6 by 

p(W) = p(X,Y)=i*||X-Y|| 2 . 

(a) Show that the equations of motion can be expressed in the form 

mW"(t) = -Vp(W(0) 

where Vp(W) is the vector of partial derivatives of p with respect to the 6 vari¬ 
ables of W. 

(b) Show that the quantity 

im||W'(f)H 2 + KW(0) 

does not change with time. 


5.2 Planetary motion 


There is a large variety of force fields in nature; gravitational fields are an important 
case. Take in particular the motion of a particle of mass m in the gravitational field 
of a unit mass located at the origin. 

According to the law of gravitation, the force F exerted by a particle of unit mass 
at the origin on a particle of mass m located at the point X is directed from X to the 
origin, and its strength is m times the reciprocal of the square of the distance of X 
from the origin times some constant. To simplify the model we take the gravitational 
constant to be 1. That is, 
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F(X) = -m 


1 X 

iixfixl 


X 


= —m 


IIXII 3 


For this choice of force Newton’s law of motion reads 


mX" = —m --. (5.4) 

IIXII 3 

Notice that mass m occurs on both sides of (5.4) as a factor; dividing both sides by 
m gives an equation of motion 


X" 


X 

iixif 


(5.5) 


That the motion under the force of gravity does not depend on the mass of the 
moving object is intuitively clear, for if we imagine the moving particle cut into 
two parts, the trajectory of each part is the same, although the two parts may have 
different masses. So from now on we take the mass m of the moving particle to be 1. 

Next we observe that the term on the right side of equation (5.5) is a gradient: 


X 



= Vp, 


where p(X) = 



(5.6) 


So we can rewrite equation (5.5) as 


X" = Vp(X). (5.7) 

Conservation of energy. We turn now to studying the motion of a particle of unit 
mass in a force field that is the gradient of a differentiable function p. We have seen 
that the gravitational field of a point mass is such a field. Since the sum of gradients 
is a gradient, it follows that the gravitational field of any distribution of point masses 
is a gradient field. 

We derive now an important property of solutions of equations of the form (5.7) 
where p is a differentiable function. Take the dot product of both sides with X'; we 
get 

X"-X' = V/?(X)-X'. 

The left side is the derivative with respect to t of ^X' • X'. The right side is, by the 
Chain Rule, the t derivative of p(X(t)). Since their derivatives are equal, ^X' • X' 
and p(X) differ by a constant E independent of t : 

\X'-X'-p(X) = E. (5.8) 

The first term ^X' -X' in (5.8) is called the kinetic energy of the moving particle; 
the second term -p(X) is called the potential energy; their sum is the total energy of 
the moving particle. Equation (5.8) says that the total energy of the particle remains 
the same during motion. This is the Law of Conservation of Energy. 
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Thus we have shown that for the motion of a point mass in a gradient force field, 
conservation of energy is a consequence of Newton’s law of motion. 

Kepler’s Laws. We return to equation (5.5), governing the motion of a particle 
under the gravitational force of a unit mass at the origin. 

Since by Theorem 5.1 the motion occurs in a plane through the origin, it sim¬ 
plifies our calculations to choose coordinates in which that plane is the x,y plane. 
Therefore we set z = 0 in equations (5.5), giving 

x" + ^ = 0, /' + ^ = 0, r = (5.9) 

The energy equation (5.8) can be written as 

\((x') 2 + (y') 2 )-—E==E. (5.10) 

yx 2 + y 2 

where E is constant. We obtain another relation between x,y,x', and y f by multiply¬ 
ing the first equation in (5.9) by y, the second equation by x and taking the difference 
of the two; we get 

xy" -yx" = 0. 

Since xy" - yx" is the t derivative of xy' - yx', we conclude that xy' - yx' is a con¬ 
stant, call it A: 

xy' - yx' = A. (5.11) 

We introduce polar coordinates: 

x = rcos0, y = rsin0. (5.12) 

Differentiating these equations with respect to t gives 

x' = r'cos0-r0'sin0, y' = r'sin0 + r0'cos0. (5.13) 

A brief calculation using formulas (5.13) shows that 

(x') 2 + (y') 2 = (r') 2 + r 2 (0') 2 . 

Setting formulas (5.12), (5.13) into equation (5.11) gives 

xy' - yx' = r 2 0' = A. 

So we can express 0' as 

0 '= 4 - ( 5 - i4 > 

r z 

When A is zero, 0 is a constant, so the motion takes place along a straight line. Such 
a one-dimensional motion is uninteresting, so we take the case that A is nonzero. In 
fact we may assume A > 0 (see Problem 5.19). The energy equation (5.10) becomes 
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(r') 2 + r 2 (4>') 2 - - = 2 E. 

r 


Using (5.14) for 0' in the equation above we get 


(r') 2 + ^-- 
r l r 


2 E. 


(5.15) 


Both r and 0 are functions of t. It follows from (5.14) that 0' is of one sign; it follows 
that 0 is a monotonic function of t. Therefore we can express r as a function of 0. 
By the Chain Rule 


r 


dr dr , 
d t d 0^ 


Using (5.14) to express 0' as — we get 

yl 


, dr A 
d0 r 2 

Set this expression of r' into the left side of equation (5.15); we obtain 

1 dr \ 2 A 2 A 2 2 

— I-h-— — 2 E. 

d 0 / r 4 r 2 r 


(5.16) 


Since A is not zero, we can multiply this equation by —. We get 

A z 


dr 


2r 3 


+r ~^ =2E ^- 


We introduce the abbreviations 


2 E 1 


and rewrite the equation (5.17) as 


(5.17) 


(5.18) 


-I = ar 4 + Ibr* - r 2 
d 0 


(5.19) 


dr d0 

The derivatives — and — are reciprocals of each other. (See Problem 5.20.) Taking 
d0 dr 

the reciprocal of both sides of (5.19) we get 


w 2 i 

\ dr / ar 4 + 2 br 3 — r 2 


(5.20) 
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We introduce u = - as new variable. By the Chain Rule 
r 


d0 d0 d u d(p ( 1 
dr d u dr du\ r 2 


Use this to express the left side of (5.20), and multiply both sides by r 4 ; we get 


1 


ar 4 + 2 br 2 -r 2 a + 2 bu - u 2 


Taking the square root we get 


1 


d 0 _ 

Va + 2 bu - u 2 


(5.21) 


To determine 0 as a function of u we have to find the integral of the function of u on 
the right in (5.21). We recall that the derivative of the inverse sine function is 


— sin 1 y = 
dy 


1 


VT 


Set y = p + qu, p and q constants, q > 0. We get 

a 


d . _i q 1 

— (sm (p + qu))= = = 9 

tJi -(p + qu ) 2 V<2 + 2bu - u 2 


where a and b are 


1 -P 2 


b- 


P 

q' 


(5.22) 


This shows that we may take 0 (m) in (5.21) to be 

0(m) = sin _1 (/? + ^w), 

where p and g are related to a and b by (5.22). Therefore 

sin0 = p + qu. 

We omit a brief calculation with (5.22) that confirms q is positive except perhaps for 
circular orbits. We guide you in Problem 5.15 to discover all circular orbits. 

We recall that u is -; multiplying the equation above by r we get 


rsin(/) = pr + q. (5.23) 

We claim that this is the equation of a conic in polar coordinates. The equation is 
easily rewritten in Cartesian coordinates x and y: 
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y — p -\jx 2 +y 2 + q. (5.24) 

Hence 

p 2 (x 2 +y 2 ) = (y-q) 2 . (5.25) 

It follows from equation (5.18) that b is positive. By equation (5.22), b = - — , 

Q 

therefore p is negative, in particular p is nonzero. This shows that equation (5.25) is 
quadratic in x and y, hence its zero set is a conic. 

In Problems 5.21 and 5.22 we ask you to discover that the conic is an ellipse 
when p <- 1, parabola when p = - 1, and hyperbola when -1 < p < 0. Figure 7.20 
is a sketch of the elliptical orbit of the moon around the earth. 

Kepler based his laws on observations of the planets. His first law states that 
the orbits of planets in the solar system are conics with one of the foci at the sun. 
The calculation above shows that Kepler’s first law is a consequence of the inverse 
square law of gravitational force. Several of the Problems discuss Kepler’s other 
laws. 

Newton showed that all of Kepler’s laws are consequences of the inverse square 
law of gravitational force. This striking result led to the universal acceptance of the 
inverse square law of gravitation. 


Problems 


5.15. Consider the circular orbit defined by 

X(t) = (x(t),y(t)) = (a cos cot, a sin cot) 
where a and co are constants. 

(a) Show that the radius function r = a/x 2 +y 2 is the constant a. 

(b) Show that if co 2 a 3 = 1 then X is a solution of equations (5.9): 

(x 2 +y 2 ) 3 / 2 ’ ^ (x 2 +y 2 ) 3 / 2 

(c) Show that the constant A defined in equation (5.11) is equal to coa 2 . 

Remark. The constancy of co 2 a 3 is one case of Kepler’s laws shown in this problem 
for circular orbits, and generally in Problem 5.21. An example is the prediction of 
the orbit radius of Saturn: 

(1 Earth orbit radius) 3 (a Earth orbit radii) 3 

(1 Earth year) 2 (29.5 Earth year) 2 

that gives ^satum = 9.5, so that Saturn is 9.5 times as far from the sun as Earth is. 
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5.16. We ask you to verify that the intersection of a circular cone with a plane is 
an ellipse. There are two spheres tangent to the cone and the plane, see Figure 5.3. 
Points A, B , D , and F are colinear. The points of tangency C and E of the spheres 
with the plane are the foci of the ellipse. To verify this show the following proposi¬ 
tions. 

(a) Segments BD and DC have the same length. 

(b) Segments FD and DE have the same length. 

(c) The sum of the lengths of BD and FD is the same for every point D of the 
intersection. 

(d) The sum of the lengths of DC and DE is the same for every point D of the 
intersection. 



Fig. 5.3 A cone, two spheres, and an ellipse in Problem 5.16. 


5.17. One of Kepler’s laws is that a planet “sweeps” equal areas in equal times, that 
is, the rate of change of area suggested in Figure 5.4 is independent of the time. 
Explain the following items that prove this. 

(a) If the planet is located at U = (x(t),y(t)) at time t , then its location at time t + h is 
approximately W = ( x(t ) + x'(t)h,y(t)+y'(t)h). 

(b) The signed area of the ordered triangle OUW is |(xy' -yxf). 

(c) The rate of change of area is \A, where A is the constant in equation (5.11). 

(d) If the orbit is a closed loop, deduce from part (c) that the area enclosed is \A 
times the period T of the orbit. 


, is denoted in physics texts as 


5.18. Newton’s law (5.4), mX" = —m 


X 
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Fig. 5.4 Approximate area swept by a planet in time h, for Kepler’s law in Problem 5.17. 


r 

mr" = —in MG —-. 

|r| 3 

Define X(t) = r (kt), where the constant k is to be chosen. Use the Chain Rule to find 
k so that the equation becomes (5.4), that is, we can rescale time to eliminate MG. 

5.19. Suppose ( x,y ) = ( fit), git )) is a solution of the equations of motion (5.9) with 
xy' -yx' negative. Show that 


(x,y) = (-mg(t)) 


defines another solution with xy' -yx' positive. 


5.20. Use equation (5.16) to show that there are at most two values of r at which 
dr 

— could be zero. Deduce that away from those two values we can solve for 0 as a 
d(p 

dr dd) 

function of r and there — and — are reciprocals. 
d(p dr 


5.21. Writing the orbit curve (5.24) in the form -pr = q — y and taking the case 
p <— 1, prove the following statements. 

(a) The motion occurs in the region where y < q. 

(b) There are two points of the orbit located on the y axis. Deduce that this is an 
elliptical orbit. 

(c) Show that the semimajor axis is —-and the semiminor is . 

P ~ 1 aJp 2 -l 

(d) Deduce that the semiminor axis is A times the square root of the semimajor axis. 

(e) The area of an ellipse is n times the product of the semimajor and semiminor 
axes. Deduce from part (d) and from part (d) of Problem 5.17 that 

\AT = 7tA( semimajor axis) 3/2 . 

Hence deduce Kepler’s third law : the square of the period of the orbit is propor¬ 
tional to the cube of the semimajor axis. 


5.22. Writing the orbit curve (5.24) in the form —pr = q — y prove the following 
statements. 
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(a) The motion occurs in the region where y < q. 

(b) If p = - 1 the orbit is a parabola. 

(c) If -1 < p < 0 the orbit is half a hyperbola. 

5.23. We’ve seen in equation (5.14) that motion toward a gravitating mass is possi¬ 
ble along a line, and in fact that is the only trajectory on which a collision can occur 
(with a point mass). Consider now an inverse fifth power law, 

x" = -iixr 6 x. 

(a) Show that a potential function is ^||X|| —4 , so that j||X'|| 2 - ^ ||X|| —4 is conserved. 

(b) Let a be a positive number. Suppose a semicircular trajectory is expressed as 

X(t) = (a + a cos 0(0, a sin 0(0). 

Show that -a 2 {0') 2 --—--- is constant. 

2 16a 4 (l + cos0) 2 

(c) Take the case of zero constant in part (b), and 0' positive. Find the constant k 
such that 

0'(t) =---. 

w 1+ cos 9{t) 

(d) Multiply the equation above by 1 + cos 0 and integrate with respect to t. Assuming 
0(0) = 0, deduce that 0(0 + sin 0(0 = kt. Sketch a graph of /(0) = 0 + sin 0 as a 
function of 0 to show that there is a well-defined function 0(0, 0 < t < | with 
this property. 

(e) Show that there are trajectories that are not straight lines that tend to the origin 
in finite time. 


5.24. Suppose N masses mi,.. .,m^ are attracted to each other by gravity and have 
position functions Xi(0,.. .,X^(t). Newton’s laws give 


X 


// 

k 


N 

- E 

j*K 7=1 


mjG 


Xft-Xj 

l|X*-X/' 


The center of mass of the system is C(0 = 
of the center of mass is zero: C"(0 = 0. 


x~i N 

Zj=i m J 


Show that the acceleration 


5.25. Suppose X(0 is a solution of Newton’s law X" = -||X||~ 3 X. Set Y(0 = aX(bt), 
where a and b are nonzero numbers. Show that Y is also a solution of the same 
equation, that is, Y" = —||Y|| -3 Y, provided that a 3 b 2 = 1. (This includes the case 
(a,b) = (1,-1), where the solution is run backward, X(-t).) 


5.26. Let Xj be solutions of the system of Newton’s laws in Problem 5.24, and set 
Y j(t) = aXj{bt), where a and b are nonzero numbers. Show that the Y j are also 
solutions of the same system provided that a 3 b 2 = 1. 






Chapter 6 

Integration 


Abstract In this chapter we introduce the concept of the multiple integral—the 
precise mathematical expression for finding the total amount of a quantity in a region 
in the plane or in space. Examples include area, volume, the total mass of a body, 
the total electrical charge in a region, or total population of a country. 


6.1 Introduction to area, volume, and integral 

Examples of integrals We introduce the concept of the integral of a function of 
two or more variables through two problems. 

Mass. Let D be a set in space. Let f(x,y,z ) [mass/volume] be the density at the 
point (x,y,z) of some material distributed in £>. How can we find the total mass 
M(/,£>) contained in £>? 

If the density is between lower and upper bounds t and u then the total mass is 
between the bounds 

lWo\(D) < M(f,D) < uVol(D) 

where Vol (D) denotes the volume of £>. This estimate for the mass in D is a good 
start. But perhaps we can do better. Split D into two subsets D\ and £> 2 . On each 
one / has a lower and upper bound, 

£\ < f(x,y,z) < u\ in £> 1 , 

h < f(x,y,z) < U 2 in £> 2 . 

Thus we have 

h Vol(Di) < M(/,£>i) < ui Vol (£> 1 ), 

^2 Vol (£> 2 ) < M(/,£> 2 ) < u 2 Vol(D 2 ). 


© Springer International Publishing AG 2017 
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Adding these inequalities we get that 

£\ Yol (Di) +£ 2 Vol (D 2 ) 

<M(f,D 1 ) + M(f,D 2 ) 

< U\ Vol(Z)i) + U2 \ol(D 2 ). 

The sum of the masses in D\ and D 2 is the mass in D , The upper bounds 

for the density on D\ and D 2 are less than or equal to u : 

u\ <u, u 2 < u. 

The lower bounds are greater than or equal to £\ 

£<£x, £<h- 

Putting this all together we get that the total mass M(/, D) satisfies 

£Vol (D) < £ x Vol (Dx ) +£ 2 Vol ( D 2 ) 

< M(f,D) 

< ux Vol(Z)i) + u 2 Vol (D 2 ) < u\ol(D). 

By subdividing D into n nonoverlapping subsets D\,... ,D n we can repeat this pro¬ 
cess and get a sequence of inequalities that get us closer to the value of the total 
mass in D , 

n n 

2 €j Vol (Dj) < M(f, D) < 2 uj Vol (Dj). 

j =1 J =1 

Population. In a similar manner let D be a set in the plane, say the map of a country. 
Suppose the population density at (x,y) is f(x,y ) [population/area] and that on D, 
the population density is between £ and u , 

£ < f(x,y) < u. 

If we know the area of D we can estimate the total population P(f, D) in D as 
/’Area (D) < P(f,D) < wArea(Z)). 

Using an approach like we did for mass, we refine the estimate by splitting D into 
nonoverlapping subsets D 1 and D 2 . Let £\ and £ 2 be lower bounds, and ux and u 2 
upper bounds, for the population density in D 1 and D 2 , respectively. Then 

£\ Axq2l(Dx) + £2 Area(T> 2 ) ^ P(/,T>) < ux Area (Dx) + u 2 Area (D 2 ). 

Splitting D into n nonoverlapping subsets we get that the total population in D sat¬ 
isfies 


y>AreaCD ; ) < P(f,D) < ^ w;Area(£»/). 
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These two examples raise two important questions. 

• How do we find the area of a subset of the plane or the volume of a subset in 
space? 

• What properties of / and D assure that the process of taking upper and lower 
estimates squeezes in on a single number? 

That number, if it exists, is what we call the integral of / over D. 

First we look at the question of area and volume. 

Area Our discussion of area is based on three basic properties of area in the plane. 
We shall explain later that what is said here about area applies, with appropriate 
changes, to volume in R 3 , as well as ^-dimensional volume in R n . 

(a) If two sets C and D in the plane have area, and C is contained in D , then the 
area of C is less than or equal to the area of D. 

(b) If two sets C and D in the plane have area and have only boundary points in 
common then the union of C and D has area that is the sum of the areas of C 
and D. 

(c) The area of a rectangle whose edges are parallel to the v and y axes is the product 
of the lengths of the edges of the rectangle. It doesn’t matter whether all, some, 
or none of the boundary points are included in the rectangle. 

To find out whether a subset D of the plane has area we introduce the notions of 
lower area and upper area of D. Take h > 0. Divide the whole plane into squares 
of edge length h by the lines x-kh and y - mh , where k and m are integers. The 
boundary of each /z-square is included in the h- square. 




Fig. 6.1 D is the quarter disk including its boundary. There are four 1-squares in D, and twenty-two 
^-squares filling more area. 


For each bounded set D, let Nl(D , h) be the number of /^-squares that are con¬ 
tained in D. Let Ai(D,h) be the area of all those squares, 

A L (D,h) = h 2 N L (D,h). 

The squares of edge length j h that are contained in D fill out all the squares of edge 
length h that are contained in D and possibly more. See Figure 6.1. Therefore their 
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total area is greater than or equal to the total area of squares of edge length h\ 


A L (D,\h)>A L (D,h). 


( 6 . 1 ) 


Setting h = ^ in inequality (6.1) shows that the sequence 

A L (D,±) = (±) 2 N L (D,j, r) n= 1,2,3,... (6.2) 

is an increasing sequence. Since D is bounded, the total area of /t-squares contained 
in D is less than the area of a large square containing the set D. Therefore (6.2) is 
a bounded sequence. We remind you of the Monotone Convergence Theorem, that 
states that a bounded increasing sequence has a limit. Call this limit the lower area 
of D and denote it as Al(D)\ 


A l (D)= lim A l (D,±). 

n—>oo ^ 



Fig. 6.2 D is the quarter disk including its boundary. Twenty 1 -squares intersect D, and fifty 
^-squares intersect D, covering a smaller area than the 1-squares. 


We next define the upper area. Denote by Nu(D,h ) the number of all /^-squares 
that intersect D. Denote by Au(D,h ) the area of the union of these squares: 

A u (D,h) = h 2 N u (D,h). 

If a square of edge length \h intersects D, so does the square of edge length h that 
contains it. Therefore 

A V (D, ±h)<Au(D,h). (6.3) 

This shows that the sequence 

A u (D,±) = (±) 2 N u (D,±) n = 1,2,3,... 

is decreasing. Again by the Monotone Convergence Theorem a decreasing sequence 
of nonnegative numbers has a limit; call this limit the upper area of D , and denote 
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it as Au(D): 

A V (D) = lim Au(D, X). (6.4) 

n^oo ^ 

Since fewer /z-squares are contained in D than /z-squares that intersect D, A^(D, h) 
is less than or equal to Au(D,h). See Figures 6.1 and 6.2. It follows that so are their 
limits: 

A l (D) < AuiP). (6.5) 

In words: the lower area is less than or equal to the upper area. We define now the 
concept of area. 


Definition 6.1. If the lower area of a set D is equal to its upper area, this 
common value is called the Area(D), and we say that D has area. 


Next we verify that area so defined has the three properties we listed at the start 
of this section. 

(a) If two sets C and D in the plane have area, and C is contained in D, then the 
area of C is less than, or equal, to the area of D. 

Since C is contained in D, for each h , A^(C,/z) < Al(D,K). So the lower area 
Al(C) < Al(D). Since C and D have area, 

A l (C) = Area(C), A L (D) = Area(D), 

so Area(C) < Area(D). 

(b) If two sets C and D in the plane have area and have only boundary points in 
common then the union of C and D has area that is the sum of the areas of C 
and D. 

Since C and D have only boundary points in common every h- square in C or 
D will be an h- square in the union CUD. However, CUD may contain more 
/z-squares if C and D share part of their boundaries. So 

A l (C, h) + A l (D, h) <A l (C U D, h). 

Similarly the number of /z-squares that intersect C plus the number that intersect 
D is greater than or equal to the number of /z-squares that intersect CUD since 
some /z-squares intersect both. Therefore 

Ajj{C UD,/z) < Ajj(C , h) + Ajj(D, h). 

Let h-^n tend to zero in the inequalities 

A l (C, h) + A l (D, h) < A l (C UD,h)<Au(CuD,h)< Au(C , h) + A V (D, h). 

The right side tends to Area(C) + Area (D), as does the left. Therefore the two 
center terms tend to a common limit, that is by definition Area(C U D). 
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(c) The area of a rectangle whose edges are parallel to the x and y axes is the 
product of the lengths of the edges of the rectangle. It doesn’t matter if all, 
some, or none of the boundary points are included in the rectangle. 

Let R be the rectangle [a,b] x [c,d\. The number of /z-squares that meet the 
boundary of R does not exceed twice the perimeter divided by h (when h is 
smaller than the sides of R). Therefore 


0 <(b- a)(d — c)— Al(R , h) < 


4((b-a) + (d-c)) 2 

- 1 - h 

h 


and 


0 < Ajj{R , h)-(b- a)(d - c) < 


A({b-a) + {d-c)) 2 
- H - '' 


Letting h tend to zero we see that the lower and upper areas of R are both equal 
to (b - a)(d - c). A similar argument works whether the rectangle R contains 
some or none of its boundary. 


Smoothly bounded sets. We show now that all sets that we intuitively think of 
having area—smoothly bounded geometric figures—have area in this sense. 


Definition 6.2. A smoothly bounded set D in the plane is a closed bounded 
set whose boundary is the union of a finite number of curves each of which is 
the graph of a continuously differentiable function, either 

y = f{x ), v in some closed interval, 


or 


v - /(y), y in some closed interval. 



Fig. 6.3 The smoothly bounded set D in Example 6.1. 


Example 6.1. The set D shown in Figure 6.3 is a smoothly bounded set. The 
boundary, included in D , is the union of three curves that are graphs of con¬ 
tinuously differentiable functions: 


x = fi(y) = y 2 (0 <y < 1), 
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x = f 2 (y) = 2-y 2 (0 < y < 1), 

and 

y = h(x) = 0 (0 < x < 2). 


□ 

Example 6.2. The boundary of the set D shown in Figure 6.4 is the union 
of curves that are graphs of continuously differentiable functions. D is a 
smoothly bounded set. □ 



Fig. 6.4 The smoothly bounded set D in Example 6.2. 


Theorem 6.1. If D is a smoothly bounded set then its upper and lower areas 
are equal, so D has area. 


The key result needed to prove Theorem 6.1 is the following result. 

Theorem 6.2. Let C be the boundary of a smoothly bounded set D in R 2 . 
Denote by C(h) the number of squares of edge length h that intersect C. Then 

C(h ) < C - (6.6) 

h 

where c is some number that depends on C. 


Proof. D is smoothly bounded so its boundary, C, is the union of a finite number 
of graphs of continuously differentiable functions. To prove the theorem we prove 
inequality (6.6) for the graph of a single continuously differentiable function 

y = g(v) a<x<b 

and add the inequalities. Denote by m an upper bound for the derivative of g on 
[a,b\, 


\g\x)\<m. 
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The number of //-squares that intersect the graph of g in the strip 

nh < x < (n + 1 )h 


is at most m + 2. Therefore the total number of //-squares that intersect the graph of 
g on the interval [a, b] is at most m + 2 times the number of //-intervals in [a, b]. The 
number of //-intervals in [a,b\ is less than or equal to See Figure 6.5. Therefore 


C(/z) < (m + 2) 


b - a 
h 


(m + 2 ){b - a) 
h 


This proves that we can take c = (m + 2)(b - a) in inequality (6.6) for the part of the 
boundary that consists of just one smooth graph. If the boundary of D consists of a 
finite number of smooth graphs, the estimate (6.6) follows by adding the coefficients 
c for each part. This completes the proof of Theorem 6.2. □ 



Fig. 6.5 On [a,b ] the graph of y = 3x is covered by no more than (3 + 2)(^jf L ) h- squares. 

We now show that the upper and lower area of a smoothly bounded set are equal. 

Proof, (of Theorem 6.1) Denote by C(h) the number of //-squares that intersect the 
boundary of D. Then 

C(h)>Nu(D,h)-N L (D,h). 

It follows from Theorem 6.2 that 

0 < h 2 Njj(D , h) - h 2 N L (D , h) < h 2 C(h) < ch (6.7) 

for some number c. Let h tend to zero through the sequence h = We saw before 
that the sequences h 2 Nu(D,h ) and h 2 NL(D,h ) tend to the upper and lower area of 
D. It follows from inequality (6.7) that the limits 

A V (D)= lim h 2 Nu(D, h) = lim h 2 N L (D, h) = A L (D) (6.8) 

/ z —>0 / z —>0 


exist and are equal. This common limit was defined as the area. Therefore the area 
exists. □ 
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Remark. In the special case where B is the boundary of a smoothly bounded 
set, A L (B,h ) = 0 for all h and A L (B) = 0. By equation (6.8), Ajj(B) = A L (B) and 
the area of B is zero. By properties (a) and (b) of area we see that the area of a 
smoothly bounded set D with boundary B is the same whether any boundary points 
are included in D or not. 

Volume Divide 3-space into cubes of edge length h by the planes x-kh , y = mh , 
z = nh , where k , m, and n are integers. See Figure 6.6. 



Fig. 6.6 An h -cube in 3-space, bounded by planes. A plane y-mh is indicated. 


We define in an entirely analogous fashion to area, using /z-cubes in the place of 
/z-squares, the notion of lower volume Vl(D) and upper volume VuiP ) of a bounded 
set D in 3-space. 


Definition 6.3. If the lower and upper volumes of D are equal, this common 
value is called the volume of D , Vol(D), and we say that D has volume. 


Next we define a class of sets in space that we will show have volume. 


Definition 6.4. A smoothly bounded set in R 3 is a closed bounded set whose 
boundary is the union of a finite number of graphs of continuously differen¬ 
tiable functions 

z = f(x,y), or y = f(x,z ), or x = f(y,z ), 
defined on smoothly bounded sets in the coordinate planes. 


A solid spherical ball and a solid cube are examples of smoothly bounded sets in 

R 3 . 

The next result is an extension to three dimensions of Theorem 6.2. 
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Theorem 6.3. Let S be the boundary of a smoothly bounded set in R 3 and 
denote by S ( h ) the number ofh-cubes that intersect S. Then 

where s is a number that depends on S . 



Fig. 6.7 The number of h -cubes in a stack that may intersect the graph of / is determined by the 
derivative V/. 


See Figure 6.7. The proof of this theorem is modeled on the proof of Theo¬ 
rem 6.2. 

Proof Since S is the union of the graphs of a finite number of continuously dif¬ 
ferentiable functions, it suffices to prove the inequality for the graph of a single 
differentiable function, say z = f(x,y), defined in a smoothly bounded set D in the 
x,y plane. Consider an /z-square contained D. For points Pi and P 2 in this /z-square 
there is a point P so that 

|/(Pi)-/(P 2 )| = |V/(P).(Pi -P 2 )| < max ||V/(Q)||||P 1 -P 2 || 

Q in D 



This bound helps us see that the number of /z-cubes that intersect the graph of / over 
a single h -square that intersects D is less than 2m, where m is an upper bound for the 
magnitudes of the first partial derivatives of / on D. Since the number of h -squares 
that intersect D is less than , where a is the area of a rectangle containing D, the 
number of /z-cubes that intersect the graph of / is less than ^, as claimed. □ 

It follows from Theorem 6.3 that the total volume of the /z-cubes that intersect S 
is less than ^-/z 3 = sh , which tends to zero as h tends to zero. 










6.1 Introduction to area, volume, and integral 


215 


It also follows from Theorem 6.3 that the lower and upper volume of a smoothly 
bounded set D in 3-space are equal, so we can state the following theorem. 

Theorem 6.4. Smoothly bounded sets in R 3 have volume. 


Volume in R n . Volume in ^-dimensional space can be defined entirely analo¬ 
gously, using ^-dimensional boxes. An ^-dimensional h- box consists of all points 
X = (xi,X 2 ,...x n ) for which 

njh < xj < (nj + l)h, (j=l,...,n). (6.9) 

where the nj are integers. The zi-dimensional volume of the h- box is defined to be 
h n . 

The concepts of a smoothly bounded set in ^-dimensional space R n and its vol¬ 
ume are a direct generalization of the three-dimensional case. The statement and 
proof of Theorems 6.3 and 6.4 can be readily extended to n dimensions. In Prob¬ 
lem 6.6 we ask you to determine the volume of several h -boxes. 

We have shown that smoothly bounded regions in R 2 have area and similarly 
smoothly bounded regions in R 3 have volume. Next we look at two examples of 
bounded sets, one in the plane whose area is not defined and the other in space 
whose volume is not defined. 

Example 6.3. Define D to be the set of points (v,y) in the unit square 0 < v < 1, 
0 < y < 1 where both v and y are rational numbers. The h- squares of the upper 
area will have total area greater than 1. As h tends to zero we see Au(D) = 1. 
Since every /z-square no matter how small contains points that have rational 
and irrational coordinates, the interior of D is empty and we get Al(D ) = 0. 
Even though D is contained in a square with area we cannot say what the area 
of D is, since Al(D ) 4 Au(D). See Figure 6.8. □ 


y 



Fig. 6.8 A set in R 2 with undefined area in Example 6.3. 


Example 6.4. Define D to be the set of points (v,y,z) in the unit cube 0 < v < 1, 
0 < y < 1, 0 < z < 1 where v is irrational. That is, remove from the solid cube 
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Fig. 6.9 A set in R 3 with undefined volume in Example 6.4. 


all points that lie on planes satisfying x-r rational. The /z-cubes that intersect 
D have a total volume greater than 1 and as h tends to zero, we get VuiP ) = 1. 
Since D has no interior points, Vl(D ) = 0. So Vl(D ) A Vjj{D). D does not 
have volume. See Figure 6.9. □ 

Examples and properties of multiple integrals Before making our formal defini¬ 
tion of the integral we give you two examples of the integral of a function / over a 
set D , and investigate some of the properties common to both examples. 

Example 6.5. Let D be a smoothly bounded set in R 2 , and z = f(x,y) > 0 the 
height of the graph of / above D. The volume V(J,D) of the region R in R 3 
defined by the inequality 

0 <z<f(x,y), (x,y) in D, 

is an example of the integral of / over D with respect to area, 

V(f,D)= f f(x,y)dA. 

Jd 

See Figure 6.10. Another notation for this integral is 

V(f,D)= f f(x,y)dxdy. 

Jd 

The significance of the dxdy notation will be explained in Section 6.3. □ 

Example 6.6. Let D be a smoothly bounded set in R 3 . Denote by f(x,y,z ) the 
density at the point (x,y,z) of D of some material distributed in D. The total 
mass M(/, D) contained in D is an example of the integral of / over D with 
respect to volume, denoted as 

M(f,D)= f f(x,y,z)dV. 

Jd 
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Fig. 6.10 The region R lies under the graph of / and above D. 


Again as we will see in Section 6.3 another notation for this integral is 
M(f,D) = f f(x,y,z)dxdydz. 

JD 

□ 

We use these two examples to illustrate key properties that we will use to define 
the integral, namely dependence of the integral on the function / and on the set D. 



Fig. 6.11 Adding the heights of thin columns. 


(a) In Example 6.5 consider three regions in space with the same base D and with 
heights /, g , and f + g. See Figure 6.11. The volume of a thin column with 
height (/ + g)(v,y), where (x,y) is in the small base B of the column, is the sum 
of the volumes of two thin columns with base B , one with height f(x,y ) and the 
other with height g(v,y). Therefore the volume of the region defined by 

0 < z < f(x,y) + g(x,y), (x,y)inD 


is the sum 


V(f + g,D) = V(f,D) + V(g,D). 
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Denoting the volume as the integral of height with respect to area we write this 
as 


f (f + g)dA= f f dA+ f gdA. 
Jd Jd Jd 


In Example 6.6 take the case that there are two materials distributed in D, with 
densities / and g. Their total density is / + g, and the total mass in the region is 
the sum of the total mass of each material: 


M(f + g,D) = M(f,D) + M(g 9 D). 


Denoting the total mass as the integral of density with respect to volume we 
write this as 


f (f + g)dV= f fdV+ f gdV. 
Jd Jd Jd 


(b) In Example 6.5 change the height / by a factor c. Volume is changed by the 
same factor: 

V(cf,D) = cV(f,D). 


Denoting volume as an integral we write this as 


f cfdA = c f 
Jd Jd 


fdA. 


In Example 6.6 introduce a different unit for mass. Then both density and the 
total mass are changed by the same factor: 


M(cf,D) = cM(f,D). 


Denoting mass as the integral of density / we write this as 




Properties (a) and (b) can be summarized by saying that the integral of / over 
D depends linearly on /. 




Fig. 6.12 Left: The region R under the graph of /. Center: The cylinder of height i is contained in 
R. Right: The cylinder of height u contains R. 














































6.1 Introduction to area, volume, and integral 


219 


(c) In Example 6.5 let £ and u be lower and upper bounds, £ < f(x.y) < u, for the 
height at all (x,y) in D. Then the volume over D lies between the bounds 

/’Area (D) < V(f,D) < wArea(D). 


See Figure 6.12. We can write 

£ Area ( D ) < I fdA<u Area (D). 

Jd 

Similarly in Example 6.6 if the density at all points of D lies between two 
bounds € < f(x,y,z ) < u, then the total mass contained in D lies between the 
bounds 

/’Vol (D) < M(f,D) < u\o\(D). 

This is called the lower and upper bound property. We can write this as 

€No\(D) < f fdV< uYol(D). 

Jd 

(d) In Example 6.5 if D is the union of two disjoint sets C and E , the total volume 
is the sum of the volume over C and the volume over E , V(f,CUE) = V(f, C) + 
V(J,E). We can write it as 


r f dA= r fdA + r 

JCUE Jc JE 


C and E disjoint interiors. 


Similarly in Example 6.6 the total mass contained in the union of two disjoint 
sets C and E is the sum of the masses contained in each set, Af(/, C U E) = 
M(j,C) + M(j,E). See Figure 6.13. This property is called the additivity prop¬ 
erty. We can write it as 


f fdV= f fdV+ f 

JCUE Jc Je 


fdV. 



Fig. 6.13 Additivity of mass when objects are joined at their boundaries. 
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Problems 

6 . 1 . Let D be the set of points (x,y) in R 2 such that 0 < ^ < 1 -x 2 . 

(a) How many \-squares are in D1 

(b) Does the |-square \\, |] x [|, |] intersect D1 

6 . 2 . Let D be the closed quarter disk in the first quadrant, with the disk center at the 
origin and radius 3. See Figure 6.1. 

(a) Find Au(D, ^). 

(b) Find A l (D,\). 

6 . 3 . Make a sketch of the rectangle D given by 0 < v < n, 0 < y < n. Find 

(a) the number Nl(D , of \-squares contained in D, and their total area; 

(b) the number N\j{D , \) of ^-squares that intersect D , and their total area; 

(c) the number C(^) of ^-squares that intersect the boundary of D , and their total 
area; 

(d) Verify that C{\) > N V (D , \)-N L (D , \) 

6 . 4 . Show that the ball |X| < 1 in R 3 is a smoothly bounded set. 



Fig. 6.14 Sets D and E in Problem 6.5. 


6 . 5 . The intersection of two smoothly bounded sets D and E in R 3 is along a 
smoothly bounded common surface in Figure 6.14. Show that 

vol (DUE) = vol D + vol E. 

6 . 6 . Find the ^-dimensional volume of each set. 

(a) A union of 57 h -boxes in R n , 

(b) The box in R 3 where 0 < xj < 10, j = 1,2,3 ,n = 3, 

(c) The box in R 6 where 0 < xj < 10, j = 1,2,3,4,5,6, n = 6. 

(d) The box in R 3 where 0 < xj < jq, j = 1,2,3, n = 3, 

(e) The box in R 6 where 0 < xj < jq, j = 1,2,3,4,5,6, n = 6. 


6 . 7 . A plate shown in Figure 6.15 has density f(x,y). 
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Fig. 6.15 A plate with variable density in Problem 6.7. Three subdivisions are shown. 


(a) If 2 < f(x,y ) < 7 in D show that 

630 < mass(D) < 2205. 

(b) If 2 < f(x,y ) < 4 in D\, and 4 < f(x,y ) < 7 in use additivity to show that 
990 < mass (D) < 1800. 

(c) Further information becomes available, that 2 < f(x,y ) < 4 in D 3 , / = 2 in £> 4 , 
4 < f(x,y ) < 6 in D 5 , and 6 < f(x,y ) < 7 in £> 6 - Use the additive property and the 
lower and upper bound properties to further narrow the mass estimate to 

1230 < mass (D) < 1686. 



Fig. 6.16 A map of population density, for Problem 6.8. 



0-10 

10-20 

20-40 

40-80 

80-160 

160-320 


6.8. A region is subdivided into counties, with the range of population density 
(people/square mile) / indicated on the map in Figure 6.16. The map key gives 
lower and upper bounds in each region: 
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P: 0 < f(x,y) < 10 
Q : 20 < f(x,y) < 40 
R : 10 < f(x,y) < 20 

5: 80 < f(x,y) <160 
T : 160 < f(x,y) < 320 
U: 40 < f(x,y) < 80 


(a) 

(b) 

(c) 

(d) 


The area of county Q is 250 square miles. Give bounds for the population of Q. 
The Western region consists of counties P, R, and U, that have areas 80, 100, 120 
square miles, respectively. Give bounds for the population of the Western region. 


Counties S and T have about the same area. Is it possible for 


I 


f(x,y)dA to be 


larger than 


f/ ,x ■ 


y)dA? 


Show that -4600 


< f fdA- f 
Jo Ju 


fdA< 5200. 



Fig. 6.17 The liquid volumes in Problem 6.9. 


6 . 9 . A liquid of variable density p fills the region D in the large beaker in Fig¬ 
ure 6.17, and another liquid of constant density S = 200 [kg/m 3 ] fills region C. Each 
of C and D has volume .05 [m 3 ]. The total mass of liquids is 

I ddV+ I pdV = 30. 

Jc Jd 

(a) Find the value of each integral. 

(b) Show that the minimum and maximum of p must satisfy 

Pmin — 400 < Pmax- 

6 . 10 . Let D denote the closed unit disk x 2 +y 2 < 1 in the plane and let U denote the 
open disk x 2 + y 2 < 1 . 

(a) Describe the boundary of D. 

(b) Describe the boundary of U. 

(c) What is Area(D)? Area(t/)? 
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6.2 The integral of a continuous function of two variables 

We focus in this section on the definition of the integral in the case where / is a 
continuous function on a smoothly bounded set D in the plane. 

We shall first define the upper and lower integrals, Iu(f,D) and II (/,/)), for con¬ 
tinuous functions / that are nonnegative and show that they are equal; their common 
value is defined as the integral of / over D and denoted J(/, D). We then verify that 
/(/,£>) has the four basic properties of the integral introduced in Section 6.1. 

(a) J(/ + g,D) = /(/,D) + /(g,D) 

(b) I(cf , D) = cl (/, D) for every number c 

(c) If £ < f(x,y ) < u for all (x,y) in D , then 

^Area(D) < /(/,£>) < wArea(D). 

(d) If C and D are disjoint or have only boundary points in common, 

J(/,CUD) = /(/,C) + /(/,D). 

We then extend the definition of the integral to functions that may have negative 
values, and verify the four properties listed above. 

In Theorem 6.10 we will show that all properties of the integral follow from these 
four properties. 

Upper and lower integrals of a continuous function / over D. According to the 
Extreme Value Theorem 2.11, / is bounded on D. Divide the plane into squares of 
side h as described in Section 6.1, and take all /z-squares that intersect D. In each of 
these h- squares Bj we denote by uj and €j an upper and a lower bound for /: 

0 < £j < f(x,y) < uj < M, (x,y) in Bj. (6.10) 

Call the sum 

h 2 uj (6.11) 

j 

over all /z-squares that intersect D an upper h-sum. Call the sum 

Y/tj ( 6 - 12 ) 

j 

over all /z-squares that are contained in D a lower h-sum. 

We claim: For a nonnegative function / every upper h-sum is greater than or 
equal to every lower h- sum. 

To see this we observe that for an h- square that is contained in D, the term h 2 uj 
in ( 6 . 11 ) is greater than or equal to the corresponding term h 2 £j in ( 6 . 12 ), since 
an upper bound uj is greater than or equal to the lower bound lj. The terms in the 
sum (6.11) corresponding to /z-squares that intersect D but are not contained in D 
do not appear in (6.12). Since the function / is nonnegative, so are the upper bounds 
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uj and the terms ujh 2 . It follows that the upper sum is greater than or equal to the 
lower sum for the same value of h. 

Next we show: For a nonnegative function /, given an upper h- sum, there is an 
upper /z/2-sum that is less than or equal to the upper /z-sum. 

To prove this we observe that if an /z/2-square intersects D , so does the /z-square 
that contains it. An upper bound uj for / in the h- square is an upper bound for / 
in each /z/2-square contained in it. If all four /z/2-squares contained in the /z-square 
intersect D , their contribution to the h/2 upper sum (using the same uj) equals the 
contribution of the /z-square to the upper /z-sum. If some of the /z/2-squares don’t 
intersect D , their contribution to the sum of /z/2-terms using the same Uj is less than 
or equal to the contribution of the /z-square to the h- upper sum. This shows that the 
upper h/2 -sum can be chosen to be less than or equal to the upper h- sum. 

Since / > 0 all upper /z-sums are nonnegative, i.e., zero is a lower bound for 
the set of upper /z-sums. The Greatest Lower Bound Theorem states that if a set of 
numbers has a lower bound then the greatest lower bound of that set exists. Call 
it !/(/,£>,/z). We have shown that for / > 0 every upper /z-sum has a correspond¬ 
ing upper /z/2-sum that is less than or equal to the upper /z-sum; it follows that 
U(J,D,h/2) is less than or equal to U(f,D,h). This shows that 

U{f,D,h), h=±, n = 1,2,... (6.13) 

is a decreasing sequence of numbers that are greater than or equal to zero. Therefore 
by the Monotone Convergence Theorem it has a limit as n tends to infinity. This limit 
is called the upper integral of /. We denote it as 

W,D)= lim U(f,D, 2~ n ). 

n—>oo 

There is an analogous result for lower sums: For / > 0, given a lower /z-sum, 
there is a lower /z/2-sum that is greater than or equal to the lower /z-sum. 

To prove this we note that the union of all /z/2-squares contained in D includes 
all h- squares contained in D. A lower bound lj for / in an /z-square is a lower bound 
for f in all /z/2-squares contained in it. An /z/2-square contained in D but that is not 
part of an /z-square that is contained in D gives a nonnegative contribution to the 
/z/2-sum, provided we choose a nonnegative lower bound for /. The resulting lower 
/z/2-sum is greater than or equal to the corresponding lower /z-sum. 

Each lower /z-sum is less than uA, where u is an upper bound for f on D and 
A is the area of a square that contains D. The Least Upper Bound Theorem states 
that if a set of numbers has an upper bound then it has a least upper bound. Denote 
by L(/,D,/z) the least upper bound of the lower /z-sums. We showed that for f > 0, 
for each lower h- sum there is a greater or equal lower /z/2-sum, so it follows that 
L(J,D,h/2) is greater than or equal to L(/,D,/z). Therefore 

L(f,D,h), h=jn, n= 1,2,... (6.14) 

is an increasing sequence less than or equal uA. By the Monotone Convergence 
Theorem this bounded, increasing sequence has a limit, called the lower integral of 
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/ over D\ we denote it as 


h(f,D)= lim L(f,D,2~ n ). 

n—>oo 


Theorem 6.5. The upper and lower integrals of a continuous nonnegative 
function f over a smoothly hounded set D in R 2 are equal, 

I L (f,D) = Iu(f,D). 


Proof To show that the upper integral and lower integral of a continuous function 
are equal we show that for every positive tolerance e , no matter how small, the upper 
h- sum and lower /z-sum differ by less than e when h is sufficiently small. 

We have shown that for / > 0 every lower /z-sum is less than or at most equal to 
every upper /z-sum. 

Differences due to the boundary. The difference between the number of terms in 
an upper h- sum ( 6 . 11 ) and the number of terms in a lower h- sum ( 6 . 12 ) is less than 
or equal to the number C(/z) of h- squares that intersect the boundary of D. We have 
shown in Theorem 6.2 that C(/z) is less than or equal to c/h for some constant c 
determined by the boundary of D. Therefore the total contribution of these terms to 
the upper /z-sum is less than or equal to 

Mh 2 C(h ) < Mh 2 - = Mch, 
h 

where M is an upper bound for the uj in that set. This tends to zero as h tends to 
zero. This shows that the sum of the terms in the upper sum that are not in the lower 
sum tends to zero as h tends to zero, so is less than | if h is sufficiently small. 

Differences due to h-squares in D. We show that the difference of the sum of the 
terms that are in both the upper sum and the lower sum tends to zero as h tends to 
zero. The difference of the upper and lower sums over h- squares in D is 

Yftiuj-tj). (6.15) 

j 

The function / is continuous on a closed bounded set D\ therefore it is uniformly 
continuous: for every tolerance t, we can choose a precision p so small that if the 
distance between two points in D is less than p , the values of the function / at these 
two points differ less than t. Take h so small that the distance between two points in 
an /z-square is less than p. If Ui is the maximum of / and li is the minimum of / on 
the /z-square then 

Using this estimate for the terms in the sum (6.15) shows that the difference between 
the sums over interior squares is less than Nh 2 t , where N is the number of terms in 
the sum. N is the number of /z-squares contained in D , which is less than A/h 2 , where 
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A is the area of a rectangle containing D. This shows that the difference (6.15) is less 
than At. Given the tolerance 6, take t = then the difference (6.15) is less than |. 

Adding our two estimates then we have shown that there are upper and lower h- 
sums with difference less than e if h is sufficiently small. This shows that the upper 
and lower integrals are equal, and thereby completes the proof of Theorem 6.5. □ 

Upper and lower integrals of bounded functions. Although we focused on the 
basic case where / is continuous and D includes its boundary, the definitions we 
made for %(/, D) and II(f,D) apply to bounded functions /, not necessarily con¬ 
tinuous, on a smoothly bounded set D. In that case the lower and upper integrals 
II(f,D) and Iu(f,D) both exist but they might not be equal. 


Definition 6.5. Let / be a bounded function on a smoothly bounded set D. If 
/{/(/, D) = /^ (/,£>) we say that / is integrable on D and that the integral of / 
over D exists. We write the integral as 

I L (f,D) = Iu(f,D)= f fdA. 

Jd 

The function / is called the integrand. For Cartesian coordinates x,y in the 
plane the integral of / over D is also denoted 

f f(x,y)dxdy. 

Jd 

and is called a double integral. 


Remark. By essentially the same argument as for Theorem 6.5 it can be shown 
that a bounded function that is continuous on the interior of a smoothly bounded set 
is integrable, i.e., that A(/,D) = 

Example 6.7. Let D denote the rectangle [0,1) x (0,2), and 

f(x,y)=y sin—L 
l-x z 

Then 0 < f(x,y) <y <2 and / is continuous in the interior of D. Therefore by 
the remark above, the integral 


f ysin—!_dA 

Jd 1-x 


exists. □ 

Approximate integrals. Next we show that the integral of / over D can be estimated 
with arbitrary accuracy without calculating upper or lower sums. 





6.2 The integral of a continuous function of two variables 


227 


Definition 6.6. Let / be defined on a smoothly bounded set D in the plane. 
Divide the plane as before into h- squares and denote by Cj = (xj,yj) some 
point of the j- th /z-square contained in D. Define an approximate integral , or 
Riemann sum , 

S (f,D,h) = f^f(Cj)h 2 , (6.16) 

j 

where the sum is taken over all h- squares contained in D. 


Theorem 6.6. Let f >0 be an integrable function on a smoothly bounded set 
D. As h- 2~ n tends to zero, every sequence of approximate integrals 

S(f,D,h) = Y J f(C j )h 2 
j 


converges to 

f fdA, 
Jd 

independent of the choice of the points C j. 


Proof Since /(Cy) lies between every lower and upper bound for / on the j-th 
h- square, S (/, D , h) is greater than or equal to every lower /z-sum. It is also less than 
or equal to every upper /z-sum since the terms in the upper h- sum corresponding to 
/z-squares that are not contained in D are greater than or equal to zero, since f > 0. 
Therefore S (/, D , h) is contained between the least upper bound of all lower h- sums 
and the greatest lower bound of all upper h- sums: 

L(/, D, h) < S (/, D, h) < U(f,D,h). (6.17) 

Because / is integrable on D , L(/,D,/z) and U(f,D,h ) tend to the same limit, that 
we called the integral of / over D. It follows from inequality (6.17) that S ( f,D,h ) 
also tends to the same limit: 


lim S (/, D, h) 



as asserted. 


□ 


Example 6.8. Let / > 0 be a continuously differentiable function on a smoothly 
bounded set D in the plane, and let R be the region above D and under the 
graph of /. See Figure 6.18. R is a smoothly bounded set in R 3 . We show that 


I fdA is the volume of R. By Theorem 6.6 the Riemann sums 

JD 
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Fig. 6.18 The sets D and R in Example 6.8. 


converge to I fdA as h tends to zero. Every Riemann sum using h -squares 

Jd 

is greater than or equal to the h -cube lower volume of R , and is less than or 
equal to the upper h -cube volume of R , 

V L (R,h) < ^ Vu(R,h). 


Since R is smoothly bounded the upper and lower volumes have the same limit 
as h tends to zero, that is the limit of the Riemann sums as well. Therefore 



= Vol (R). 


□ 

Example 6.9. Let f(x,y ) = 1 on a smoothly bounded set D , and let R be the 
region under the graph, over D. By Example 6.8 


Vol (R) = f IdA. 
Jd 


From our definition of volume we see that 


Vol (R) = Area (D)(1). 


Therefore 


X 


IdA = Area(D). 


Example 6.10. Evaluate 


X 


ydA 


where D is the rectangle 0<v<5,0<y<3. The graph of f(x,y) = y over D 
is shown in Figure 6.19. By Example 6.8 the integral is the volume of the set 
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R of points (x,y,z) with 0 < z < f(x,y), 0 < x < 5, 0 < y < 3. 

ydA = Vol (R) = j(3)(3)(5) = 22.5. 


L 


We next show that our definition of I fdA for nonnegative continuous functions 

Jd 

over smoothly bounded sets satisfies the four properties we stated at the start of this 
section. 


Theorem 6.7. Let D denote a smoothly bounded set, / and g nonnegative 
continuous functions in D, and c a nonnegative number. Then 

(a) f (f + g)dA= f fdA+ f gdA 
Jd Jd Jd 


(b) 


I cfdA = c I 

Jd Jd 


fdA, 


(c) If l < f(x,y ) <u on D then ^Area (D) < ( fdA < u Area(Z)), 

Jd 

(d) IfC and D are smoothly bounded sets that are disjoint or have only bound¬ 
ary points in common, and if CUD issmoothly bounded, then 


f fdA = f fdA + f fdA. 

Jcud Jc Jd 


Proof, (a) It follows from the definition of approximate integrals that 
S(f + g,D,h) = S(f,D,h) + S(g,D,h) 

provided we use the same points C j in forming all three approximate integrals. 
Letting h tend to zero gives part (a). 

(b) Similarly, 


S (cf, D, h) = cS (/, D, h). 
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Letting h tend to zero gives part (b). 

(c) To get an upper bound for the integral of / over D we replace each /(Q) in 
the Riemann sum (6.16) by the upper bound u for the values of the function /. This 
gives an upper bound for the Riemann sum. The limit of this upper bound as h tends 
to zero is u Area(D); it is an upper bound for the integral, as asserted in part (c). By 

a similar argument ^Area(D) < /dA. 

Jd 

(d) If C and D are disjoint, then for h small enough an h- square may belong to 

C or D , but not both. In this case a Riemann sum for / over C U D is the sum of a 
Riemann sum for / over C and a Riemann sum over D. If C and D have a boundary 
curve in common, there are h- squares that belong to CUD but belong neither to C 
nor to D. These /z-squares intersect the common boundary of C and D. Since the 
boundaries of C and D are smooth curves, the number of h- squares they intersect is 
less than some constant multiple of 1 //z. Therefore their total area tends to zero as 
h tends to zero. Therefore the Riemann sum over these squares tends to zero; this 
proves part (d). □ 


Example 6.11. Let f(x 9 y ) = 4 + y and let D = [0,5] x [0,3] be the rectangular 
region 

0 < v < 5, 0 < y < 3. 


From 

know 


Example 6.10 we know that I ydA = 22.5 and from Example 6.9 we 

Jd 

that 4 ( 1 dA = 4Area(D) = 4(15). So by Theorem 6.7 
Jd 

f (4 + y)dA = 4 1 dA + C ydA = 4(15) + 22.5 = 82.5. 

Jd Jd Jd 


□ 

We now extend the definition of the integral to functions that may have negative 
values. 


Definition 6.7. Let / be a continuous function on a smoothly bounded set D 
in R 2 . Write / as the difference of two continuous functions g > 0 and k > 0: 

f = g~k. 


We define the integral of / over D as the difference of the integrals of g and k 
over D, 



There are many ways of writing / as such a difference. We show that all such 
differences give the same value for the integral. Take two decompositions of / 
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f = g~k, f = m — n. 


(6.18) 


where g , k , m, and n are nonnegative and continuous on D. We show that calculating 
the integral of / using either of these decompositions of / gives the same value, i.e., 
we want to show that 


f gdA- f kdA= f mdA - f ndA. 
Jd Jd Jd Jd 

To prove this relation we rewrite it as 

I gdA + I ndA = I mdA + I kdA. 
Jd Jd Jd Jd 


By part (a) of Theorem 6.7, since g + n > 0 and m + k > 0, we can rewrite the two 
sides as 

I (g + n) dA = I (m + /:) dA 
Jd Jd 

Since it follows from relations (6.18) that g + ft and m + k are equal, so are their 
integrals. This proves that the definition of the integral of / does not depend on how 
we write / as a difference of two nonnegative functions. 

If now f > 0, use the difference 


We get that 

f (~f)dA= [odA- f /dA. 

Jd Jd Jd 

Since I 0 dA = 0 this gives 
Jd 

f (-f)dA = - f /dA. 

Jd Jd 

Next we claim that Theorem 6.6 —asserting that the integral of / over D is the 
limit of the Riemann sums S (/, D , h) as h tends to zero—is valid for all continuous 
functions /, not just for nonnegative ones. To see this represent / as the difference 
g-k of two nonnegative functions g and k , and use the same points Q for each 
function. Then 

2/(C i)h 2 = S{f,D,h) 


= S(g-k,D,h) = Y J (g(Ci) - k(C,))h 2 


= Yj g(Ci)h 2 - Y *(C i)h 2 = s (g, D,h)-S (k, D, h). 


232 


6 Integration 


Then apply Theorem 6.6 to the functions g and k. As h tends to zero, the right side 
tends to 

f gdA- I IcdA, 

Jd Jd 

therefore so does the left side. Since g-k = /, this is by Definition 6.7 equal to 

f /dA. 

Jd 

The same reasoning, representing the integrand as the difference of nonnegative 
functions, shows that Theorem 6.7 on the linearity, boundedness, and additivity of 
the integral holds for continuous functions and numbers that may be negative. We 
formulate this as follows. 


Theorem 6.8. For continuous functions f and g on smoothly bounded sets C 
and D, and all numbers c, 

(a) f (f + g)dA= f /dA + f gdA, 

Jd Jd Jd 

(b) f cfdA = c f /dA, 

Jd Jd 

(c) If l < f(x,y ) < u then ^Area(D) < ( /dA < u Area(D), 

Jd 

(d) IfC and D are disjoint or have only boundary points in common and CUD 
is smoothly bounded then 



f /dA+ r 
Jc Jd 


/dA. 


Example 6.12. Evaluate 



where D is the closed unit disk centered at the origin. Observe that the inte¬ 
grand f(x,y) = x 3 y 2 has a symmetry: 


f(-x,y) = -x 3 y- = -f(x,y). 


Consider forming a Riemann sum, where for each point C j = (xj,yj) that we 
choose in an h- square with xj > 0 we choose the corresponding point ( ~xj,yj ) 
for the /z-square on the other side of the y-axis. Then the Riemann sum is 
exactly zero. Since this can be done for every h , and since the integral is the 
limit of Riemann sums, we get 


X 


x 3 y 2 dA =0. 


□ 

In Problem 6.23 we ask you to use the properties above to prove the next theorem. 
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Theorem 6.9. For continuous functions f and g defined on smoothly bounded 
sets C and D in R 2 , 

(a) If g(x,y) < f(x,y) for all (x,y) in D then 


f gdA< f 
Jd Jd 


fdA, 


(b) IfC is a subset ofD and f >0 on D then 


f fdA < f 
Jc Jd 


fdA. 


Example 6.13. Let / > 0 be a continuous population density function on D 
and denote by a > 0 a number in the range of /. Suppose / > a on a smoothly 
bounded set C c D. Using Theorem 6.9 we estimate the population of D as 


f fdA > f fdA > f 

Jd Jc Jc 


adA = a Area (C). 


See Figure 6.20. 


□ 



Fig. 6.20 / > 0 and / > a in C. So J* fdA> J* fdA > aArea(C). See Example 6.13. 


As we noted earlier in the chapter, the case of continuous functions on smoothly 
bounded sets D is basic, and we focused our attention on them. In fact analogues to 
Definition 6.7 and Theorems 6.7, 6.8, and 6.9 hold for bounded functions that are 
integrable on the interior of smoothly bounded sets. 

Next we show that if a number J?(f,D) is defined for all continuous functions / 
on smoothly bounded sets D , that satisfies the four properties listed in Theorem 6.8 
then J?(f,D) is the integral of / over D. 
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Theorem 6.10. Suppose is defined for every continuous function f 

on a smoothly bounded set D and that it has the following properties. 

(a) S(f + g,D) = S(f, D) + J?(g, D) 

(b) J?(cf,D) = cJ?(f,D)for every number c 

(c) If€< f(x,y) < ufor all (x,y) in D, then 

^Area(D) < (/,£>) < zzArea(D), 

(d) For all pairs of smoothly bounded sets C and D that are disjoint or have 
only boundary points in common, 

/(/,CuD) = /(/,C) + /(/,D), 

Then 

S(f,D)= f fdA. 

JD 


Proof. We shall deduce from these four properties that for f > 0, (/,£>) is less 

than or equal every upper sum and greater than or equal to every lower sum. To 
see this, take an upper /z-sum and lower /z-sum and partition D into subsets D, that 
are the intersections of D with the h- squares. Denote by and Ui lower and upper 
bounds for / on Df. By properties (c) and (d) we get 

2 e,h 2 < Tj £i Area (D t ) 

h -squares in D 

< J?(f, D) < y ui Area (D,) < u,h 2 , 

h -squares intersecting D 


therefore 

L(f,D,h) < < U(f,D,h). 

As h tends to zero we get 


I L (fiD) < S(f,D) < Iu(f,D). 

Since Iu(f, F>) = Il(J,D) = I fdA, their common value is J?(f,D). 

JD 

Now given a continuous function / not necessarily nonnegative, choose a positive 
function p so that / + p is positive as well. According to property (a) 


S(J + p 9 D) = S(J 9 D) + S{p,D). 


X 


(f + p)dA = S(f,D)+ pdA. 


L 


So 
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Subtract to get 

S(f,D)= f (/ + p)dA- f pdA= f fdA. 
Jd Jd Jd 


□ 


Problems 


6.11. Use an area or a volume interpretation to evaluate the integrals. 

(a) I 1 dA where D is the region 1 < x < 2, 0 < y < logx. 

Jd 

(b) I xdA where D is the rectangle 0 < x < 3, -l<y<l. 

Jd 

(c) J* yj l - x 2 - y 2 dA where U is the unit disk centered at the origin. 


(d) I 1 dV where H is the half ball x l + y l + z z < 1, z > 0. 

Jh 


6.12. Use a volume interpretation to evaluate I fdA for each function /. 

Jd 

(a) f(x,y ) = 3 and D is the disk of radius 5 centered at (0,0). 

(b) f(x,y ) = \y and D is the rectangle where -2 < v < 3 and 0 < y < 4. 

(c) f(x,y) = y/x 2 +y 2 and D is the unit disk centered at the origin. 


6.13. Sketch rectangles D = [- a , 0] x [0, b ] and E = [0,1] x [-c, c] in the plane, where 
a , b, c are positive. Determine without calculation which of these integrals are posi- 


tive. 


(a L 

f xdA 

'D 
r * 

(b L 

I xdA 

r* 

(C L 

I (l-v)dA 

(d L 

f/ M 


6.14. Let f(x,y ) be a continuous function on a smoothly bounded set D that is sym¬ 
metric about the origin, that is, D contains the negative of each of its points, and 
assume f(-x, -y) = - f(x,y ) for all points of D. 

(a) Find /(0,0) if (0,0) is in D. 

(b) Show that there are approximate integrals of / over D that are exactly zero. 

(c) Show that I fdA = 0. 

Jd 
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(d) Evaluate I xydA where D is the unit disk centered at the origin. 

Jd 

(e) Which of the following functions satisfy f(-x, -y ) = -f(x,y)l 

x , y 2 , xcos y, xy 2 , x-y. 

6.15. Use properties of the integral, symmetry, and a volume interpretation to eval¬ 
uate 

(y 3 + 3xy + 2)dA 




where D is the unit disk centered at the origin. 

6.16. Suppose / is a bounded function and D a smoothly bounded set in R 2 . 


(a) Show that if Area(ZJ) = 0 then the interior of D is empty. 

(b) Show that if the interior of D is empty then IlU\ D) = 0. 


(c) Conclude that if / is an integrable function and Area(Z)) = 0 then I fdA = 0. 

Jd 

6.17. Justify the following items which prove: 

If / is continuous on R 2 and I fdA = 0 for all smoothly bounded sets R , then / is 

Jr 

zero at all points of R 2 . 

(a) If f(a , b) = p> 0 then there is a disc D of radius r > 0 centered at ( a , b) in which 

f(x,y o > \p. 

(b) If / is continuous and f(x,y ) > p\ > 0 on a disk R then 


I 


fdA > pi(Area(7?)) > 0. 


(c) If / is continuous and I fdA = 0 for all smoothly bounded regions R , then / 

Jr 

cannot be positive at any point. 

(d) / is not negative at any point either. 


6.18. Write a Riemann sum for 


/ 

Jro, 


x 2 y 3 dA 


'[0,l]x[0,l] 

using points ( xj,yj ) at the upper right corner of each ^-square. 

6.19. The sum over integers i and j, 

Yj iHhf + ijhffh 2 

z' 2 + j 2 <10h~ 2 


approximates which integral, 
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f (v 2 +y 2 ) 2 dA, or f (x 2 +y 2 ) 2 

Jx 2 +y 2 < 10 2 J* 2 +v 2 <10 


dA? 


<10 z ^x z +;y z <l 

6.20. Let D be a rectangle [a, /?] x [c, J] and consider integrals of the form 


X 


f(x)g(y)dA 


where / and g are continuous functions of one variable. Use the notion of Riemann 
sum to prove that the integral of the product is the product of the integrals: 


f mg(y)dA= f f(x)dx f d g(y)dy. 
JD Ja Jc 


6.21. The integral 

J- f sinf--— )dA 

J[0,i]x[0,i] Hi - v 2 )(l -t 2 )^ 

exists because the integrand is bounded and is continuous in the interior of the 
square. 

(a) Find upper and lower bounds on the integrand. 

(b) Calculation using Riemann sums indicates that 


f 

J[0 


[0,.999]x[0,.999] 


sin(-2-—)dA = .423 


approximately. Assuming that is correct, find bounds on /. 


6.22. Let f(x,y) = (4- 
where r < 2. Is / bounded? integrable on D? 


y 2 ) 1/2 in the disk D of radius r centered at the origin, 


6.23. Prove Theorem 6.9 by justifying the following steps, 
(a) For part (a): 


(i) 

VI 

o 

-g(x,y). 

(ii) 

o< (/- 



JD 

r* 


(iii) 

0< (/- 

g)dA = 


JD 

r * 

r * 

(iv) 

gdA< 

JD 

/dA. 

JD 


= f m-f 


gdA. 


(b) For part (b): 

(i) Every /z-square in C is in D. 

(ii) For every Riemann sum for 
is equal or larger. 


X 


/dA there is a Riemann sum for I /dA that 


X- 
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(iii) Every Riemann sum for I fdA does not exceed fdA. 


'c Jd 


(iv) f fdA < f fdA. 


c Jd 


6.3 Double integrals as iterated single integrals 


Now that we have defined the integral 



for a continuous function / over a smoothly bounded set D in R 2 we show how to 
use definite integrals of a single variable to compute it. We show in this section that 
such integrals can be evaluated by performing integration with respect to x, followed 
by integration with respect to y. 


(x,d(x)) 



Fig. 6.21 Left: D is x simple. Right: Disy simple. 

We say that a bounded set D is x simple if it has the following property: The set 
of all points in D whose second coordinate is y is an interval D(y) parallel to the v 
axis, whose endpoints a(y) and b(y) are continuous functions of y: 



a(y) < x< b(y), c <y <d. 


See Figure 6.21. The integral of a continuous function f(x,y) with respect to v over 
the interval D(y ) 



is a continuous function of y. We integrate this function with respect to y between c 
and d, getting 
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We call this an iterated integral. 

We can make similar calculations for functions over y simple sets: The set of all 
points in D whose first coordinate is x is an interval D(x) that is parallel to the y axis 
and whose endpoints c(x) and d(x) are continuous functions of x: 

c(x) < y < d{x ), a<x<b. 

See Figure 6.21. We can then calculate an iterated integral 

J r b / r \ r b / ry= d ( x ) \ 

I f(x,y)dy\dx= I I f(x,y)dy dx. 
a \JD(x) / Ja \Jy=c(x) I 

Let’s look at some examples. 



Fig. 6.22 The set D in Example 6.14. 


Example 6.14. Let D be the region bounded by the graphs of y = sin x and y = 0 
for 0 < x < In. Find the iterated integral of the constant function f(x,y ) = 7 
over D. We sketch the region D in Figure 6.22. D is the union of two y simple 
sets. For 0 < x < n we have 0 < y < sinx, and for n < x < 2n we have sinx < 
y < 0. So we set up two iterated integrals 


r*x=7T ( r>y =sinx \ r>x=2n ( s*y =0 

L (JU 7 *r+L (X 


7 dy dx 


;y=sinx 


r'X—'Ji r^x-ln 0 

If r n X—JT r n X—z.71 

7sinxdx+ I -7sinxdx= -7cosx + 7cosx 

Jx=0 Jx=7T 1 Jv=0 1 ix=n 

= —7(—1 - 1) + 7(1 - (-1)) = 28. 


□ 

Example 6.15. Let D be the region bounded by the graphs of x = y 2 and x = y, 
and let /(x,y) = 2xy 3 . See Figure 6.23. Region D is both x simple and y simple. 
Every point (x,y) in D has o<y< l, and for each y between 0 and 1, D(y ) is 
the interval of points with x values between y 2 and y. The iterated integral 


X' (L v 4 * - f 1C v 4 = X' w 
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Fig. 6.23 The set D in Example 6.15. 


= £' CvV -,V)d y = £' <v J -y 7 )dy = [!/-!/]; = i - i = £. 

Viewing D as y simple we see that every point in D has 0 < v < 1. The set 
D(v) is between v and yfx. The iterated integral 



We can generalize the notion of an iterated integral over an x simple set to deal 
with more complicated sets. Denote by D(y) the set of all points of a set D whose 
second coordinate is y. For each value of y, suppose D(y) consists of a finite number 
of intervals parallel to the x axis whose endpoints are piecewise continuous func¬ 
tions of y. Since D is bounded, D(y) is empty for all y outside some interval (c,d). 
From single variable calculus we know that for such a set D , the integral of a contin¬ 
uous function f(x,y ) with respect to x over D(y) is a piecewise continuous function 
of y. Integrate this function with respect to y between c and d. Denote the resulting 
number as 


II(f,D) = 



f(x,y)dx\dy. 


We call this the iterated integral of / over D. We show now the following. 


Theorem 6.11. Let f be a continuous function on a smoothly bounded set D. 


Then the iterated integral II(f,D) is equal to the 



fdA. 


Proof To see this we show that the iterated integral has all four properties of the 
integral listed in Theorem 6.10. 

Property (a), 
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II{f + g,D) = II(f, D) + II(g, D), 

follows from properties of integrals for functions of a single variable, since for each 
y, the integral of / + g with respect to x over D(y ) is the sum of the integral of / with 
respect to v and the integral of g with respect to v over D(y). The integral of this 
sum with respect to y is the sum of the integrals with respect to y of the two terms. 
A similar argument shows property (b): 

II(cf,D) = cII(f,D). 


(c) To derive the lower and upper bound property of the iterated integral we apply 
first the lower and upper bound property of integrals with respect to v. Denote by 
L(y ) the sum of the lengths of the intervals constituting D(y). Suppose the values 
f(x,y ) are between £ and u. The integral of f(x,y ) with respect to v over D(y) lies 
between some bounds £L(y ) and uL(y ): 

£L(y) < f fix, y ) dx < uL(y) 

JD(y) 

for all y. It follows from properties of integrals of a function of a single variable that 


£ f L(y) dy < 



f(x,y) dx 



L(y)dy. 


From single variable calculus, or by an argument similar to that in Example 6.8, we 
know that the integral of L(y) with respect to y is the area of D. So the inequality 
above can be rewritten as 


^Area(D) < //(/,£>) < wArea(D), 


which is property (c) for iterated integrals. 

To show property (d), additivity with respect to the domain of integration, we 
note that if C and D are disjoint or have only boundary points in common, then C(y) 
and D(y) are disjoint or have only boundary points in common. Therefore for each y 


X 


/ N f(x,y) dx= f f(x,y) dx+ f f(x,y) dx. 

(CUD)(y) JC(y) JD(y) 


Integrating both sides with respect to y we get 
II(f,CUD 

/(x,y)dX) dy + 



C W(cuD)Cy) 


f(x,y)dx dy 


nr f(x,y)dx)dy+nf 

Jc \JC(y) / Jc \JD( 


f(x,y) dx dy = II(f, C) + II(f, D). 


Jc \JC(y ) / Jc \JD(y) 

This completes the proof of additivity for the iterated integral. 
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Now that we have shown that the iterated integral has all four properties of the 
integral listed at the beginning of Section 6.2, we appeal to Theorem 6.10, that these 
four properties imply //(/,£>) is equal to the integral of / over D. 


Reversing the roles of v and y we have the analogous result for iterated integrals, 


f (f' 

Ja \Jd(x ) 


f(x,y)dy\dx. 


In the special case where / is continuous on a rectangle D = [a, b] x [c, d\ then by 
Theorem 6.11 

r *y=d / px=b \ 

f(x,y)dx\dy, 


and 


Therefore 


//“- ar. 

rif'-i-nr 


f(x,y)dy)dx. 


f dx I dv. 


/ d\j civ 

Example 6.16. Let D be the rectangle given by 

2<v<3, 0 < y < 1. 


Then 

£ x y 2dA = f 2 (f o xy 2 dyjd X =£ [dxy 3 f y=0 dx=£ \xdx=\. 
Integrating first with respect to v and then with respect to y we get 




Fig. 6.24 The region in Example 6.17. 
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Example 6.17. We can compute the double integral of /(x,y) = xy 3 over the 
domain D shown in Figure 6.24. D is bounded by the lines y = 0, y = 1, and 
the graphs of the functions (x in terms of y) x = 1 and x = yjy. Then 

f *y 3 cL4= C{f xy 3 dx)dy = f i(y 3 -y 4 )dy = \{\ - ±) = 

Jd Jo \J-s/y ) Jo 

This double integral may also be evaluated by 

; y=* 2 
y=0 


xy dy dx 


/->■*' fir 

Jd Jo \Jy =o 

= f x(l(x 2 ) 4 )-i(0) 2 )dx= f \x 9 dx=^. 
Jo Jo 


Problems 


6 . 24 . Compute 


X 


f(x,y)dA 


where D is the set in the plane bounded by the graphs of y = 1, x = y, and x = 4 - y, 
and /(x,y) = e x+y . 


6.25. Evaluate 


X 


ydA 


where D is the half disk where x 2 +y 2 < 1 and y > 0. 

6 . 26 . Evaluate the integrals. 


(a) f (x z -/) dA where D=[-l,l]x [0,2], 
Jd 


J2 


(b) I x 2 /(x + y) dxdy where D = [0,1] x [0,1] 

Jd 

6 . 27 . Consider the integrals 

I" VydA, I" xdA, f (^y + x)dA, 

Jd Jd Jd 

where D is the triangular region in Figure 6.25. 

(a) List the three integrals in order from least to greatest, using inequalities among 
the functions y/y, x and yjy + x on D. 
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Fig. 6.25 The set D in Problem 6.27 . 

(b) Evaluate the integrals as iterated integrals. 



Fig. 6.26 County D in Problem 6.28. 


6 . 28 . The population density [people/area] of County D , Figure 6.26, where every¬ 
one wants to live near the Southwest corner, is 


p{x,y) = 


c 

(1 + 3 x + yf 


The total population is 10 5 and c is a constant. 

(a) Find the average population density. 

(b) Express the total population as an integral over D , and set it up as an iterated 
integral. 

(c) Find c. 


6 . 29 . Evaluate 


f s, 
Jr 


sinydA 


where R is the region 0<v<l, 0<y< -\fx. 
6 . 30 . Evaluate 


X 


e?xdA 


where R is the region bounded by y = x + 2 and y = x 2 . 
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6 . 31 . Supply the missing numbers. 


/ 


[0,2] x [— 1,(?)] 


(3xy 2 + 5x 4 y 3 )dA 


H?) /~l 

= (?) xdx J 


ydy + (?) 



6.4 Change of variables in a double integral 

A change of variables relates two pairs of variables (v,y) and ( u,v ) by a function F 
from R 2 toR 2 . 


Definition 6.8. A continuously differentiable function 
F (u,v) = (x(u,v),y(u,v)) 

from an open set U in the plane to R 2 is called a smooth change of variables 
if F is one to one and its derivative matrix is invertible at each point of U. 


If F is a smooth change of variables then the Jacobian 

JF(u,v) = det£>F(w,v) = det Xu Xv 

yu yv 


is nonzero and its absolute value can be interpreted as the local magnification factor 
at ( u , v) of area under the mapping. To see this take the triangle S whose vertices are 

( u,v ), (u + h,v ), ( u,v + h ). 


These points are mapped into the points 

(x(u, v),y(u , v)), ( x(u + h , v), y(u + h, v)), ( x(u , v + h),y(u , v + h)) 

in the x,y plane and are vertices of a triangle T. See Figure 6.27. We approximate 
(x(u + h,v),y(u + h,v)) by (x + hx u ,y + hy u ) and approximate (x(u,v + h),y(u,v + h)) by 
(x + hx v ,y + hy v ), where the functions x, y and their partial derivatives are evaluated 
at (u,v). Denote by T' the triangle with vertices 

(x,y), (x + hx u ,y + hy u ), (x + hx v ,y + hy v ). 


For h small the shaded area of F(S) in Figure 6.27 is close to the area of the 
triangular region T. The error in the linear approximation of sides of T by the sides 
of T' is less than a multiple of h 2 . Two of the side lengths of T' are \\(hx u ,hy u )\\ and 
\\(hx V9 hy v )\\ 9 multiples of h 9 so changing the length of a side of T' by a multiple of 
h 2 changes the area by less than kh 3 , for some number k. (See Problem 6.33.) So the 
area of T differs from the area of 7" by an amount that is less than kh 3 . Therefore 
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Fig. 6.27 Area(T) - Area(T') is less than a constant multiple of h 3 . 


Area(7)-Area(r') < kh 3 
Area (S) ~ l h 2 

and the limit of the quotient as h tends to zero is zero. Therefore as h tends to zero, 

Area (T ) Area (IT) ^ ave th e same limit. The area of the triangle T' whose 
Area (S) Area (S) 5 

vertices are given above is 

Area(r') = \\{hx u hy v -hx v hy u )\ = \\x u y v -x v y u \h 2 . 


The area of S is \ h 2 . The ratio of these two areas is 


\JF(u,v)\ = | x u y v -x v y u \, (6.19) 

the absolute value of the Jacobian at (w,v). If the mapping F preserves orientation. 
That is if it maps small positively oriented triangles in the u , v plane to positively 
oriented approximate triangles in the x,y plane then the ratio of the areas is simply 
the Jacobian J of the mapping. 

Here is the change of variables theorem for integrals. 

Theorem 6.12. Let F(u,v) = (x(u,v),y(u,v)) denote a smooth change of vari¬ 
ables that maps a smoothly bounded set C onto a smoothly bounded set D, 
so that the boundary of C is mapped to the boundary of D. Denote by f a 
continuous function on D. Then 

j f(x,y)dxdy= j f(x(u,v),y(u,v))\JF(u,v)\dudv (6.20) 

Jd Jc 

where /F is the Jacobian of the mapping. 


Proof. Divide the u , v plane into /z-squares and approximate 



, v),y(u , v))\J¥(u , v)| du dv 
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by a finite sum 

S(.f\CJi) = ^ f(F(u j,Vj))\JF(u j,Vj)\h 2 , 
j 

where ( uj , Vj) is some point in the j-th h- square, and the sum taken over all /t-squares 
contained in C. The mapping F carries the h- squares in the u, v plane into smoothly 
bounded sets in the x,y plane that we denote as Dj. The area of Dj is approximately 
\JF(uj,Vj)\h 2 , with an error less than some multiple of h 3 . 

Let D' be the union of the sets Dj. The integral of / over D' is the sum of the 
integrals of / over Dj\ 


l 


fdxdy. 


We approximate the y-th integral above as f(u,j, vj) Area (Dj), and we approximate 
the Area (Dj) by \JF(uj,Vj)\h 2 . The error thus committed in each term in the sum is 
less than ch 3 , for some constant c. The total error committed is less than or equal 
to the sum of the individual errors. Since each individual error is less than ch 3 , and 


since the number of terms in the sum is less 


Area(D) 


, the total error is less than 


Area< 0 V)=Area (Z»d,. 


This shows that 


h 2 


\S(f,C,h)- 


■L 


f(x,y)dA 


< Ch. 


So J" f(x,y)dA and J" f(x(u,v),y(u,v))\jF(u,v)\dudv differ less than some con¬ 
stant times h. For h small enough the area of the set of points that are in D but not 
in D' does not exceed Mch where M is an upper bound on |/F(w,v)| on the closure 
of C and c is a number that depends on the boundary of C. Hence the difference 

between I /(v,y)dA and I f(x,y)dA tends to zero as h tends to zero. □ 

Jd' Jd 


r\ 

V 

J>L 

y 

1 - 

X 

( 

Q 







Fig. 6.28 The smooth change of variables in Example 6.18 maps the circle to the ellipse. 


Example 6.18. Define sets 
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C = {(u, v) : u 2 + v 2 < r 2 }, £> = {(x,y) : + (|) 2 < r 2 }. 


The boundary of C is a circle and the boundary of D is an ellipse. See Fig¬ 
ure 6.28. The mapping F (u,v) = (< au,bv ), v = au , y = bv, a > 0, b > 0 sends 
points ( u , v) in the set C one to one onto the points of D. Find the area of D. 
We have 


JF(u,v) = detZ)F = det 


a 0 
Ob 


- ab> 0, 


so F is orientation preserving. By the change of variables theorem 


Area(D) = 


I ldxdy= I \{ab)dudv = abnr 2 . 
Jd Jc 


□ 




Fig. 6.29 The mapping F in Examples 6.19 and 6.20. 


Example 6.19. Let D be the shaded set on the right in Figure 6.29. Find 

I e^ 2+v2 dA. The iterated integrals we would obtain in x,y coordinates will 
JD 

not help us evaluate this integral because we cannot find a simple antideriva- 

2 2 

five for e x +y with respect to v or y. So we try a change of variables F that is 
suggested by the polar coordinate transformation 


x = u cosv, y-u sinv. 


Note that F is one to one on an open set that contains the rectangle C given by 
2<u<5, 0<v< 27r/3. The Jacobian is 


J¥(u,v) = detZ)F(w,v) = det 


cosv —u sinv 
sinv u cosv 


2 • 2 

= u cos v + wsin v = u. 


The Jacobian is positive in C. By the change of variables theorem, 
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f e* 2+y2 dxdy= f e w2cosWsin2v wdwdv= P ( P e'^i/di/) dv 

JD Jc Jv=0 \Ju=2 ) 

rv= 27r/3 r -,m=5 /-v=27t/3 

= | Ue M ] dv = | ^(e 25 -e 4 )dv = ^(e 25 -e 4 )y. 

J y-0 L Jm=2 J y-Q 


Remark: It is customary to write (r, 0) for polar coordinates instead of (u, v). Then 
the change of variables formula is 

£f(x,y)dxdy = X /(r cos 0, r sin 0)r drd$. 


Example 6.20. Let Z) = {(x,y) : x 2 +y 2 <25} and let C denote the rectangle 
0 < u < 5, 0 < v < 2 n. See Figure 6.29. We evaluate 


L 


X+r 


dA. 


The polar coordinate mapping F(i/, v) = (i/ cos v, u sin v) is not a smooth change 
of variables on a set that contains C: detDF(0,0) = 0 and also F is not one to 
one, F(w,0) = F(u,2n). For 0 < e let 


C e = {( u , v): e < u < 5, 0 < v < In - e] 

and set D e = F(C e ). Then the change of variables theorem applies. The inte- 

2 2 

grand e* +}; is bounded on D and the area of the region C-C e tends to zero 
as 6 tends to zero, as does the area of the region D-D e . Therefore as e tends 
to zero, the relation 


f Q xl+yl dxdy = f e“ 2 
JD e JC e 


ududv 


tends to 


f e* 2+y2 dxdy = f e ? u 
JD Jc 


du dv. 


We evaluate the last integral as an iterated integral 

^v=2n / r*u=5 


r^v-ln / r*u=D 

J y-0 \Ju=0 


e M udu \ dv 


v=2tt v -,w=5 
& 


nV-Ln r 

f !■ 

Jv =0 L 


u =0 


dv = i(e 25 - l)27r. 


Example 6.21. Let D be the parallelogram in R 2 bounded by the lines 
y = -x + 5, y = -x + 2, y = 2x-l, y = 2x-4. 
and let R be the unit square [0,1] x [0,1]. See Figure 6.30. Compute 


□ 
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Fig. 6.30 The region in Example 6.21. 


X 


e x dA. 


The mapping F (u,v) = (u + v + 1, -u + 2v + 1) takes R one to one onto the 
parallelogram D. 


D¥(u,v)- 


1 1 
-1 2 


, JF(u,v) = det(ZJF(z/,v)) = 3. 


So 


f e x dxdy= f e u+v+1 \3\dudv = 3e f f e u e v dudv 
Jd Jr Jo Jo 

= 3 e f e M dw f ( 

Jo Jo 


e v dv = 3e(e- 1) . 


Problems 

6.32. Consider the mapping 


(a) Show that the Jacobian is 1. 

(b) Sketch the triangle with vertices (1,0), (1,1), and (0,1) and its image under the 
mapping. Show that its image has the same area. 




1 1 

u 

_T_ 


0 1 

V 


6 . 33 . Figure 6.31 shows a triangle whose sides are multiples of h. The size of the 
triangle has been changed by moving one side so that one edge is changed by kh 2 
for some k. Use the steps below to show that the change in area is less than or equal 
to some constant times /z 3 . 
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2 

h k 



Fig. 6.31 The triangles in Problem 6.33. 


(a) Show that t = k\h 2 for some k \. 

(b) Show that m = + & 3 h 2 for some k^,k 3 . 

(c) Show that the change in area is less than some multiple of h 3 if h is sufficiently 
small. 


6 . 34 . For a mapping of the form (x,y) = (/(w),g(v)), verify that the Jacobian is given 
by 


det 


x u x v 
y u Tv 


/W(v). 


6.35. Consider the mapping defined by the complex square 

x + \y = (u + iv) 2 , 


that is, x = u 2 - v 2 , y = 2 uv. It maps the quarter annulus shown in Figure 6.32 to 
the half annulus, for example. Find the Jacobian at the point (1,0), and sketch the 
approximate image of the small indicated triangle, that has vertices at ( 1 , 0 ), ( 1 . 1 , 0 ), 
and (l,.l). 




Fig. 6.32 The mapping in Problem 6.35. 


6 . 36 . Use the change of variables (v,y) = (wcosv, wsinv) to evaluate the integrals, 
where U is the unit disk centered at the origin. 

(a) yf^+f-dA, 

(b) f (3 + x 4 + 2x 2 y 2 +y 4 )dA, 

Ju 
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JU 


6.37. Use the change of variables (v,y) = (rcos#, rsin0) to polar coordinates to eval¬ 
uate the integral 


Jd 

where D is the set of points (v,y) with a 2 < x 2 + y 2 < b 2 . a,b are positive constants. 


6.38. Use symmetries or the change of variables (v,y) = (rcos0,rsin0) to evaluate 
the integrals, where D is the annulus 




6.39. Verify that each of the following formulas defines a mapping of the unit square 
C to the rectangle D shown in Figure 6.33. In each case, make a sketch that shows 
your initials written in C, and their image in D. 

(a) (x,y) = (5m, 3v) 

(b) (jc,y) = (5v,3M) 

(c) (x,y) = (5v,3-3w) 


y 


3 


v 


D 


C 


U 


X 


0 


0 


5 


Fig. 6.33 The sets in Problem 6.39. 


6.40. For each mapping in Problem 6.39, 


(a) calculate the Jacobian det Xu Xv , and 

yu Tv 

(b) state whether the mapping is orientation preserving or not. 
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6.41. Substitutions are made in the integral I (1 + xy)dxdy using the mappings in 

JD 

Problem 6.39 and their Jacobians from Problem 6.40. Verify the formulas: 


(a) 

| (l+xy)dxdy = 

| (1 + I5uv)15dudv. 


JD 

r* 

JC 

r* 

(b) 

I (l-\-xy)dxdy ^ 

(l + 15vM)(-15)dMdv. 


JD 

r* 

JC 

r * 

(c) 

I (l+vy)dvdy = 

JD 

J (1 + 15v(l - u))\5dudv. 


6.5 Integration over unbounded sets 

To integrate a function / over an unbounded set we will require that the unbounded 
set D is the union of an increasing sequence 


D\ c T >2 c T>3 c • • • 


of smoothly bounded sets. 

We start by giving some examples. 

Example 6.22. ( a) The first quadrant D , consisting of all points ( x,y ) with v 
and y nonnegative, is unbounded. Let D n be the square 0 <x<n, 0 <y<n. 
See Figure 6.34. Quadrant D is the union of the sets D n . 

(b) Let D again be the first quadrant and let D n be the points where v > 0, y > 0, 
and v + y < n. Again D is the union of the D n . See Figure 6.34. 

(c) D is the half plane consisting of all points (v,y) with v > 0, and D n is the 
rectangle 0 <x<n, -n<y <n. 

(d) D is the whole plane and D n is the square —n< x<n, -n<y <n. 

(e) D is the whole plane and D n is the disc of radius n centered at the origin, 

x 2 +y 2 < n 2 , 



Fig. 6.34 The first quadrant is the union of increasing squares or triangles. 

Integration of a continuous function / over an unbounded set D is defined by 
considering the limit of a sequence of integrals of / over smoothly bounded sets. 
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Definition 6.9. Let D be an unbounded set in R 2 . Denote by D{n ) the set of 
points in D whose distance from the origin is less than or equal to n. See 
Figure 6.35. A nonnegative continuous function / defined on D is integrable 
over D if the sequence of numbers 



n - 1,2,3,... 


converges. The limit of this sequence is called the integral of / over D. We 
say that the integral exists , and write 


f fdA = lim f fdA. 

JD n ^°° JD(n) 



Fig. 6.35 The sets D(n) engulf the first quadrant D. The sets D( 4), D( 5), and D( 6) are indicated. 


We show now that we may replace the sequence of subsets D(ri) by any increasing 
sequences of bounded subsets D n whose union is D : 


Theorem 6.13. Let D be an unbounded set, and 


D\ c T >2 c £>3 c ••• 


an increasing sequence of smoothly bounded subsets ofD whose union is D, 
and let f be a nonnegative continuous function on D. Then 


lim 

m —>oo 



fdA = lim 

n —KX3 



fdA 


in the sense that if one limit exists, so does the other, and the limits are equal. 
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Proof. Suppose first that lim I fdA exists. For a nonnegative continuous func- 

m—>oo n 
L) m 

tion / both sequences of integrals above are increasing. We observe that for each 
D{n ), all subsets D m for m large enough contain D{n)\ it follows that 


lim I fdA > I fdA. 

m ~ > °° JDn, JD(n) 


Therefore by the Monotone Convergence Theorem 


lim f 

n ^°°JD 

lim f fdA > lim f fdA. 


fdA exists and 


D(n ) 

Now since the sets D m are bounded, all subsets D(ri) for n large enough contain D m . 
From this the opposite inequality follows. Therefore the two limits are equal. The 
same argument applies if we reverse the roles of D n and D(n). □ 


Here is an example of an integral over the whole plane: 


r 

JR2 


2 2 

e -x dA. 


( 6 . 21 ) 


We evaluate this integral over the plane in two different ways. 

(i) Choose as the sequence D n the squares —n < x < n, -n<y <n. According to 
Theorem 6.11 


/ e • - d , n r 

OD n J -n \J-n 


e x2 q };2 dv]dy= C e y2 dy f 
J-n J-n 


e - * 2 dx. 


The two integrals on the right, one with respect to x, the other with respect to y, are 
equal. Therefore the right side is equal to 


f 


e x2 dx\ . 


Does this sequence of numbers converge? For v > 1 we have 0 < e x < e x and 

f oo f co ^ 

since I e~ x dx exists, I e _x dx exists as well. By symmetry 


f 


So I e dx converges to I e dx, and 


r e-=d>= r 

J —OO Jo 

f 


e xl dx. 


r e -^44=(r 

Jr 2 \J-o 


e x dx 


( 6 . 22 ) 
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exists. Next we find its numerical value. 

(ii) Choose as the sequence D n the discs x 2 +y 2 <n 2 . In the integral over D m 
make a change to polar coordinates x = rcos6 , y = rsin#. As we saw in Exam¬ 
ple 6.20, when we make the change of variables we get 


r e ~ x2 - y2 d a = r 

*Jd„ Pn 


rdrdO 


where P n is the rectangle 0 < r < n, 0 < 6 < In. 

This integral can be carried out explicitly; it is equal to 


X" (f de Y r2 rdr= - e "” 2) ' 


The limit is n as n tends to infinity. This shows that 
(6.22) we conclude that 



dA = 7i. By equation 



dx = 


(6.23) 


It is curious that we have found the value of the integral (6.23) with respect to a sin¬ 
gle variable by finding the value of the integral (6.21) with respect to two variables. 

We remark that the evaluation of the integral (6.23) is basic in the theory of 
probability. The graph of the function 


m = -L e ~ x2 

y7T 

is the classic bell curve, of the normal distribution. Probability is an important appli¬ 
cation of integration over unbounded sets. 

Not all functions describing probabilities are continuous on all of R 2 . So we 
extend the concept of integration over an unbounded set D to functions that are not 
continuous on D. 


Definition 6.10. Let D be an unbounded set in the plane. Let / be a bounded 
nonnegative function on D that is integrable on D(m) for some m (as described 
in Definition 6.9) and is continuous on D - D(m). We say that / is integrable 
on D if the sequence of integrals 

f /dA, f /dA, f /dA, ... 

JD(m ) JD(m+ 1) JD(m+2) 

converges. We denote this limit 


lim 

n^oo 



/dA = 


X 


/dA. 
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Definition 6.11. We say that a function p is a probability density function if 

(a) p(x,y) > 0 for all ( x,y ) in R 2 , and 

(b) f p(x,y)dA=l. 

Jr 2 


If p is integrable on a set D then the probability that (v,y) is in D is 

I p(x,y)dA. 

Jd 


Example 6.23. Show that 

P(x,y) = te~ (x2+y2) 


is a probability density function, and find the probability that (v,y) is in the 

first quadrant. We showed that I e~ x2 ~ y2 d A = n, so 

Jr 2 

f ie- (jc2+ y 2) cL4 = 1. 

JR2 71 ' 

Since p is nonnegative, p is a probability density function. The probability 
that (x,y) is in the first quadrant is 


/ 


lp— i^+y 2 ) 


d A = lim 


x>0, ;y>0 


J r*7r/2 r*n 

„ 


rdrd^ = f- 

Z 7T 


lim 



_ l 
“ 4* 


We might also have reasoned that by symmetry the integral of p on each of 
the four quadrants is equal; therefore the probability is □ 

In the next example p is not continuous on R 2 but it is integrable on R 2 . 

Example 6.24. Let 


p(x,y ) = 


' 2 x+c-y 

^ ~ 


0 < v < 1 and 0 < y < 2 
otherwise 


Find c so that p is a probability density function, p is not continuous on R 2 
but by Definition 6.10 p is integrable over R 2 . The integral of p over R 2 is 


f pdA= f 

Jr 2 Jd 

=ff 


D=[0,l]x[0,2] 
2 x + c-y 


pdA + 


r 

Jr 2 


pdA 


-D 


dxdy + 0 


-f 2 '- 


+ c — y 2 + 2c — 2 
— A - dy =--- 


c 

2 ' 


So c — 2. We check that p is nonnegative: in the rectangle 0<v<l, 0<y<2 
we have v > 0 and 2 - y > 0, so 
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p(x,y) = 


2x + (2 - y) 
4 


> 0 . 


□ 

We end this section by extending the notion of integrability in an unbounded 
domain to continuous functions that take on both positive and negative values. 


Theorem 6.14. Let D be an unbounded set, and f a continuous function on D 
whose absolute value is integrable over D, 



exists. 


Let D n be an increasing sequence of smoothly bounded subsets of D whose 
union is D, 

Di c D 2 c • c D n c •••, U n D n = D. 


Then the limit of the integrals of f over the sequence D n 


lim f 

n ^°° JD r 


fdA exists, 


and this limit is the same for all such sequences D n . 


Proof Decompose / as the difference of its positive and negative parts, / = /+-/_, 
where 

f+(x 9 y ) = max(/(v,y),0), f-(x,y) = max(-/(*,y),0). 

The absolute value of / is the function 


I/I = /++/-. 


This shows that the nonnegative functions /+ and /_ are less than or equal to |/|. 
Since \f\ is integrable over D , so are f+ and /_ (see Problem 6.50). 

According to Theorem 6.13, the integral of /+ and /_ over D is the limit of the 
integral of /+ and /_ over any increasing sequence of smoothly bounded subsets D n 
of D whose union is D. It follows that the limit of the integral of their difference 
/+ - /_ over D n is the difference of the limits of the integral /+ and of /_ over the 
sequence of subsets D n . Since /+-/_= /, this completes the proof of Theorem 6.14. 

□ 


Based on Theorem 6.14 we make the following definition. 

Definition 6.12. A continuous function / that takes on both positive and neg¬ 
ative values in an unbounded set D is called integrable over D if its absolute 
value |/| is integrable over D. 
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Problems 

6.42. Describe the sets in which the functions below are positive. 

(a) f(x,y) =l-x 2 

(b) f(x,y) = xy 

(c) /(x,y) = cos(27t a/x 2 + y 2 ) 

6.43. Let D be the region in R 2 where 0 < x and 0<y<l- 


(a) Evaluate I e X dxdy 
JD 

(b) Evaluate | e _x ^dxd y 

J/)_ 

(c) Show that e - ^ is not integrable over D. 


6.44. Let a > 0,b > 0. Use the change of variables theorem to show that 


X 


x 2 +y 2 <n 2 


-X 


' dAd - V= L 2 V 2 2 e 

t^r + Y u<n 2 




-(U z +v z ) 


A fab 


dudv , 


and evaluate 


r 

Jr 2 


3 -(ax 2 +Z?3; 2 ) 


dA. 


6.45. Let C7 be the open unit disk in R 2 centered at the origin, ^/x 2 +y 2 = r < 1. 
Which of these functions are integrable over R 2 -U1 

(a) logr 

(b) i 

< C) ;5I 

(d) r“ 3 


6.46. Verify the following identities for the positive and negative parts of numbers 
x and functions /. 

(a) |x| = x+ + x_ 

(b) x = x+ - x_ 

(C) 1/1 = /++/- 

(d) / = /+-/- 

/-»oo 

6.47. Evaluate | e“^ x dx by a change of variables. 


X o 

o 

f 


6.48. Evaluate I e x dx by a symmetry argument. 


6.49. Show that if / is continuous then the positive part of /, /+, is continuous. 
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6.50. Justify the following steps to prove that if / is integrable on R 2 and g is a 
continuous function with 0 < g < f then g is integrable on R 2 . 


/ 

Jdc 


(a) 

JD(n) 

(b) 0 < 


gdA exists. 


f gdA< f j 

JD{n) JD(n) 

f 

Jdc 


JD(n) 

(c) The numbers 


(d) lim 


im f * 

^°° JD{n ) 


fdA. 


)D(n) 
gdA exists. 


gdA are an increasing sequence bounded above. 


6.51. Let p be the probability density function in Example 6.24. 

(a) Sketch the set in R 2 where x + y < 2. 

(b) Find the probability that x + y < 2. 


6.52. Let p(x,y ) = 2x when 0 < x < 1 and 0 < y < 1, and p(x,y) = 0 otherwise. 

(a) Show that p is a probability density function. 

(b) Sketch the set in R 2 where x > y. 

(c) Find the probability that x > y. 

6.53. Let p be a probability density function in R 2 . Each point of R 2 is either in a 
set D or not in D. What does the equation 

| pdA-\- I pdA= I pdA 
Jd Jr 2 -d Jr 2 

tell you about the probability that (x,y) is not in D1 

6.54. Suppose / and g are continuous functions on R 2 so that / 2 and g 2 are inte¬ 
grable on R 2 . Use the inequality 


lab <a 2 + b 2 

to show that the following functions are also integrable: 

(a) \fg\ 

(b) fg 

(c) (f + g ) 2 


6.55. Show that / 2 is integrable on R 2 for each function /: 


(a) 


(b) e 

(c) 


1 +r A 


, where r = -y/x 2 +] 


M 


1 + r 4 
(d) ye"'' 
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6.56. Justify steps (a)-(d) to prove that if a continuous function / from R 2 to R is 
integrable on an unbounded set D then 


I f fdA \- f 

\Jd I Jd 


I/I dA. 


(a) 

f fM= f 

/+ dA - 


Jd Jd 

(b) 

(-/)cL4< 

I/I dA 


Jd 

Jd 

r * 

(c)- 

- fdA< 

I/I dA 


Jd 

P l / 

Id 

r» 

(d) 

JLH s J 

I/I dA. 

D 


I/I dA 


6.6 Triple and higher integrals 


We outline key definitions, state theorems, and look at many examples of integrals 
of functions of three or more variables. 

The notion of a smoothly bounded set in R n is defined inductively on the dimen¬ 
sion n. 


Definition 6.13. A closed set D in R n is smoothly bounded when its boundary 
is the union of finitely many graphs of C 1 functions of the form 

Xk = g(xi,...,Xk~i,Xk+i,...,x n ) some k=l,...,n 

defined on smoothly bounded sets in the hyperplane where = 0. 


The notions needed to define the volume of a smoothly bounded set in R n are 
analogous to those we used to define the concept of area of a smoothly bounded set 
in R 2 . In a similar manner we define the lower and upper volumes and say that a set 
has volume Vol (D) if its upper and lower volumes are equal. The following theorem 
is analogous to Theorem 6.4. 


Theorem 6.15. Smoothly bounded sets in R n have volume. 


The definition of the integral of a function of n variables mirrors the definition of 
the integral of a function of two variables. 
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Definition 6.14. Let / from R n to R be a bounded function on a smoothly 
bounded set D in R n . We define the upper and lower integrals 

Iu(f,D), I L (f,D ) 

for bounded nonnegative functions as we did in R 2 . If the upper integral equals 
the lower integral we say / is integrable over D , and we write the integral 

I u (f,D) = I L (f,D)= f f d n X. 

Jd 


Just as we saw for functions from R 2 to R, we can show that continuous functions 
are integrable over smoothly bounded sets. 


Theorem 6.16. For a continuous function f on a smoothly bounded set D in 
R n , the upper integral of f over D is equal to the lower integral, and the 
integral off exists: 

Iu(f,D) = I L (f,D)= f fd n X. 

JD 


We have the four properties of linearity, boundedness, and additivity: 


Theorem 6.17. For all continuous functions f and gfrom R n to R on smoothly 
bounded sets C and D, and all numbers c, 

(a) f (/ + g)d n X = f fd n X+ f gd n X, 

JD JD JD 

(b) f cf d n X = c f f d n X, 

Jd Jd 

(c) If l < /(X) < u then l\o\(D) < f fd n X<u\o\(D). 

Jd 

(d) IfC and D are disjoint or have only boundary points in common and CUD 

is smoothly bounded then f f d n X= f fd n X+ f fd n X. 

JcuD Jc Jd 


Example 6.25. The constant function f(x \,..., x n ) = 1 is continuous on R n and 
we have 1 < /(X) < 1. It is integrable over every smoothly bounded set D. By 
the boundedness property, Theorem 6.17 part (c), 

Vol (D) < f ld"X< Vol(D). 

JD 





6.6 Triple and higher integrals 


263 


Therefore Vol(Z>)= f ld n X. □ 

Jd 

The integral is characterized by the four properties in Theorem 6.17; we have the 
following theorem. 


Theorem 6.18. Suppose a number D) is defined for every continuous 
function f on a smoothly bounded set D in R n and that it has the following 
properties. 

(a) S(f + g,D) = D) + J?(g, D ) 

(b) J?(cf,D) = cjT(f,D)for every number c 

(c) Ifb < f(x\,X 2 , • • • ,x n ) < u for all (x\,X2,...,x n ) in D, then 

€Yo\(D) < J?(f,D) < mVoI(D), 

(d) For all pairs of smoothly bounded sets C and D that are disjoint or have 
only boundary points in common, 

S(f,CuD) = S(f,C) + S(f,D), 


Then 

S(f,D)= f f d n X. 

JD 


The proof is similar to the proof of Theorem 6.10 for double integrals. 

As with double integrals we often calculate integrals using approximate integrals 
(Riemann sums). 


Theorem 6.19. The integral | fd n X of a continuous function f on a 

JD 

smoothly bounded set D in R n is equal to the limit as h tends to zero of approx¬ 
imate integrals 

'Zf(?j)h n 

j 

that are sums over h-boxes in D, with any chosen points P ; - in the j-th box. 


Average of a function. The integral allows a definition of the average of a function. 

Definition 6.15. The average 

bounded set D is 

1 

Vol(D) J D J f D ld n * 


of an integrable function over a smoothly 




r j/1v 


f D fd"X 
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The next theorem gives conditions under which the average of / is a value of /. 
We say that a set D in R n is connected if for every two points A and B of D , there is a 
continuous function X from some interval [<a , b] to D so that X(a) = A and X(b) = B. 


Theorem 6.20. Mean Value Theorem for Integrals. Suppose f from R n to 
R is a continuous function on a smoothly bounded connected set D in R n , with 
Vol ( D ) ^ 0. Then there is a point P in D at which 


/(P) = 


f D f d " x 

Vol (D) ' 


Proof By the Extreme Value Theorem there are points A and B in D where / has 
its minimum, m, and maximum M, on D, 


m = /(A) < /(X) < /(B) = M. 


By the boundedness property of integrals 


/(A)< 


J D /(X)d"X 
Vol (D) 


</( B). 


Since D is connected there is a continuous function X from some interval [< a , b] to 
D with X(a) = A and X(b) = B. Thus 


/ o X(a) < 


J D /(X)d”X 
Vol (D) 


<foX(b). 


Since / o X is continuous, it follows from the Intermediate Value Theorem that there 
is a number c between a and b where 


f °X(c) 


f D f(X) d"X 
Vol (D) 


Take P = X(c). 


□ 


We focus in the rest of this section on integrals of functions of three variables. 

Triple integrals. The integral of a continuous function / on a smoothly bounded 
set D in K 3 is called a triple integral and is denoted 

f fdV. 

Jd 

Example 6.26. Let f(x,y,z ) = x 5 and let D be the set 1 < x 2 +y 2 + z 2 < 4. Find 


X 


dV. 
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We observe that f(-x,y,z) = -f(x,y,z) and D is symmetric about the y,z 
plane. We form approximate integrals ^/(P 7 )/z 3 by choosing symmetric 

j 

points (~Xi,yi,Zi) and in the h- boxes of D on opposite sides of the 

y,z plane. If we number the h- boxes on the v > 0 side of the y,z plane from 
i= 1 to TV, these approximate integrals cancel 

N 

Y J ((x i ) 5 h 3 + (-x i ) 5 h 3 ) = 0 
i= 1 


for all h. Therefore | v 5 dV = 0. 

Jd 

Example 6.27. Let D be the set 1 < v 2 +y 2 +z 2 < 4 and let 
f(x,y,z ) = sin(z 3 ) + v 5 + 200. 


We saw in Example 6.26 that I r 5 dV = 0. Similarly since sin(z 3 ) is odd and 

Jd 


D is symmetric about the x,y plane we have that I sin(r )dV = 0. Since the 

Jd 

integral of the constant function 200 over D is 200Vol(Z)), 


X 


l<x 2 +y 2 +z 2 <4 


(sin(z 3 ) + v 5 + 200) dV = 0 + 0 + 200 Vol (D). 


The volume of D is the volume inside the sphere x 2 + y 2 + z 2 = 4 minus the 
volume inside the smaller sphere x 2 +y 2 +z 2 = 1, so Vol(D) = |tt( 2 3 - l 3 ), 
and 


X 


l<x 2 +y 2 +z 2 <4 


(200 + v 5 + sin(z 3 )) dV = 


5600 


71. 


□ 


Example 6.28. Let D\ be the set of points (x,y,z) in R 3 where x 2 +y 2 +z 2 < 4, 
let L >2 be the set where 4 < x 2 +y 2 + z 2 < 9, and let 


f(x,y,z) 


j2 (x,y,z) in D l 

|l (x,y,z) in D 2 . 


Then D = D\ U D 2 is the solid ball of radius 3 centered at the origin. The 
average value of / on D is 

LfdV = f Dl 2dv + f D2 ldv = 2Vol(Di)+ Vol(D 2 ) = 35 
Vol (D) ~ Vol (D) " Vol (D) ~ 27 ' 


□ 
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Similar to the case of double integrals, triple integrals can sometimes be com¬ 
puted by computing three iterated integrals. Suppose that D is the set of points in 
M 3 that satisfy 

gi(x,y) <z< g2(x,y), (x 9 y) in D ^ 

where D x y is a smoothly bounded set in the x,y plane and g\ and g2 are differen¬ 
tiable. See Figure 6.36. Then D is a smoothly bounded set in R 3 . If / is continuous 

on D , If dV exists. For each (x,y) in D xy let D(x,y ) be the interval of points 

Jd 

(x,y,z) with 

gi(x,y)<z<g2(x,y). 

See Figure 6.36. Integrate / with respect to z from g\(x,y) to g2(x 9 y) 9 

r rz=g 2 (x,y) 

I f(x 9 y,z)dz= I f(x,y,z)dz. 

JD(x,y) Jz=g\ (x,y) 



Fig. 6.36 A set D is between graphs, gi(x,y) < z < gi(x,y). 


The result is a continuous function of (x,y) that we can integrate over D^. By 
an argument similar to the argument for double integrals we can show that iterated 
integrals satisfy the four basic properties of linearity, boundedness, and additivity. 
Therefore by Theorem 6.18 the iterated integral equals the triple integral, 


r / rz=g 2 {x,y) \ r 

f(x,y,z)dz\dA = 

JDxy \j-=g\(x,y) / Jd 


fdv. 


If Dxy is y simple, with a < x < b and c(x) <y< d(x ), where c and d are C 1 
functions, see Figure 6.37, then we can compute the triple integral by 


r r*=b / ry=d(x) / rz=g 2 (x,y) \ \ 

fdV= f(x 9 y 9 z)dz]dy] 

JD Jx-a \Jv=c(x) \Jz=gi(x,y) / / 


dx. 
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Fig. 6.37 Dxy is >7 simple. 


Example 6.29. Let D be the set in the first octant that is bounded by the surface 
z = x 2 +y 2 and the plane z = 4, and define f(x,y,z ) = x. Find 

f fdV. 

Jd 

Figure 6.38 shows a sketch of D. is the quarter disk in the x,y plane, 
bounded by the x and y axes and the graph of y = V4-x 2 . 

r xdv = r (r xd 

Jd Jd^ \J z =x 2 +y 2 ) 


Since D xy consists of the points (x,y) with 0 < x < 2 and 0 < y < 
have 


L xdv - L wi v 44 =I„ 4(4 _y2)d4 


V4-x 2 we 



-r)dy 


dx = 


J ^x-2 


■*V 


vi:“ '.h 


px=2 

= f ! 

Jx=0 


\x(4 - x 2 ) 2 ^ 2 dx = -^(4-x 2 )' 


.2x5/2 


x=2 


x=0 


- 64 
15- 


□ 



Fig. 6.38 The set £) in Examples 6.29 and 6.30. 
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Example 6.30. Let D and f(x,y,z ) = v be as in Example 6.29. Now view D 
as a smoothly bounded set between the graphs of the functions of x and z 
given by y = 0 and y = Vz- x 2 over the smoothly bounded set D xz indicated 
in Figure 6.38. 



□ 

Change of variables. The change of variables theorem for double integrals, The¬ 
orem 6.12, has an analogue for triple integrals and integrals of functions of n vari¬ 
ables. 

A smooth change of variables F is a continuously differentiable mapping from 
an open set U in R n to R n that is one to one on U and DF(P) is invertible for each 
point P in U. We recall Definition 3.4 of the Jacobian of a mapping at point P, 
/F(P) = det(DF(P)). 


Theorem 6.21. Let F be a smooth change of variables that maps a smoothly 
bounded set C onto a smoothly bounded set D, so that the boundary of C is 
mapped to the boundary of D. Let f be a continuous function on D. Then 


f/(X) d"X = J^/(F(U))|7F(U)|d"U. 


In some of the following examples of the change of variables we make tacit use 
of limiting arguments similar to the extension of polar coordinates in Example 6.20. 

Example 6.31. Find the integral of f(x,y, z)-z yjx 2 +y 2 over the region D that 
is bounded by the graphs of z- 6, z = 0, x 2 +y 2 = 4, and x 2 +y 2 = 1, and 
where v > 0. See Figure 6.39. The iterated integral in rectangular coordinates 
is ^ 

^ ^ zJx 2 +y 2 dz | dxdy = f 18 Jx 2 +y 2 dxdy 

Jd^ \ Jz=o y ) JDxy y 

where D xy is the region where 1 < x 2 + y 2 < 4 and v > 0. The double inte¬ 
gral involves integrating sjx 2 +y 2 with respect to v or y. Using a change of 
variables to cylindrical coordinates, v = rcos#, y = rsin#, z = z the Jacobian is 
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Fig. 6.39 The region D in Example 6.31. 


det 


cos 6 -rsin# 0 
sin# rcos#0 
0 0 1 


= r(cos 2 6 + sin 2 6) = r. 


The map F(r,#,z) = (v(r,#,z),y(r,#,z),z(r,#,z)) takes the rectangular box C 
given by 1 < r < 2, 0 < # < 7r, 0 < z < 6 one to one onto D. By the change of 
variables theorem 


I f(x,y,z)dxdydz 
Jd 


£ 


f(F(r, 6 , z)) rdrd#dz. 


Since x 2 +y 2 = r 2 , /(F(r,#,z)) = zr and the integral is 


f fdxdydz = P ( f If zr 2 dr)d#)dz 
JD Jz =0 \Je= 0 \Jr=l / / 


r*Z=6 n6=7T r*r= 2 

= I zdz I d# I r 2 dr = ( 18 )( 7 t)(^) = 427 T. 
Jz-0 Jo= 0 *2r-l 


Here is an example why we need the mapping F to be one to one. 


□ 


Example 6.32. The mapping F(r, #, z) = (r cos #, r sin #, z) maps the points in the 
rectangular box C given by 


l<r<2, 0 < # < 47 t, 0 < z < 6 


onto the region D between the two cylinders shown in Figure 6.40. The vol¬ 
ume of D is 7t(2 2 - 1 2 )6 = 187T. If we were to mistakenly use the change of 
variables formula to compute the volume of D we would get 

Vol ( D) = J" 1 dx dy dz =? J* |/F(r,#,z)|drd#dz. 


The right hand side is 
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Fig. 6.40 The region in Example 6.32. 


r* n 2 rAn r*6 

= I rdrdOdz = I rdr I d0 dz 

Jc Ji Jo Jo 

= i(2 2 -l 2 )(47r)6 = 367r. 


What went wrong? Even though IF(r,6,z) = 0 at each point, the mapping 

is not one to one on a set that has positive volume. The points with 2n< 6 <4n 
are sent to the same points in the range as those with 0 < 6 < 2n. □ 



Fig. 6.41 The energy density / in Example 6.33 increases toward the bottom of the tank. 


Example 6.33. A chemical reaction occurs in a tank D given by 

x 2 +y 2 < 4, 0<Z<3. 

Suppose the energy density [energy/vol] is f(x,y,z) = 100 - 5z at each point 
(x,y 9 z) in D. Find the total energy and the average energy density in the tank. 
The total energy is 


} =2n pr~2 / pz~2 \ 

(100-5z)dz rdrdfl 
=o / 


f fw= r 27r r 2 ( r 

JD Jd= 0 Jr= 0 \Jz= 

= f d6 f rdr f (100-5z)dz = (2^)|[- J^QOO-Sz) 
Je= 0 Jr= 0 Jz= o 


z =3 


z=0 


- j§(100 2 - 85 2 ). 
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The volume of the cylindrical tank is 7r(2 2 )(3) = 127T, so the average energy 
density in D is 



Fig. 6.42 Left: The prism C, Right: the tetrahedron D in Example 6.34. 


Example 6.34. Let C be the prism in R 3 given by 

0<w<l, w + v< 1, u > 0, v > 0. 

Let D be the tetrahedron given by 

* + y + z< 1, v > 0, y > 0, z > 0. 

See Figure 6.42. The function (v,y,z) = F (u,v,w) = ((1 -w)u,(l -w)v,w) maps 
the interior of C one to one onto the interior of D. The Jacobian is 


detDF(w,v,w) = det 


l-w 0 —u 
0 l-w v 
0 0 1 


= (l-w) 2 . 


By the change of variables theorem, 


Vol (D) 


= f idv = r (i- 

Jd Jc 


w) 2 dudvdw. 


Jd Jc 

As an iterated integral this is equal to 
r / r*w= 1 \ f r iw= 1 r 

(1 -w) 2 dw\dA = -i(l-w) 3 dA=i 

JC UV \ Jw= 0 / Jc uv L Jw =0 Jc, 


d4=II = I. 


Spherical coordinates. Let 


(v,y,z) = F (p.(p,6) = (p sin 0 cos 6,p sin0 sin 6,p cos0) 
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be the mapping between rectangular and spherical coordinates, p > 0, 0 < 0 < n, 
0 < 6 < 2n. F maps regions in (p,0,#) space to (x,y,z) space. 


DF(p.<f>,8) 


sin0cos 6 pcos0cos# -p sin0sin# 
sin0sin# pcos0sin# psin0cos# 
cos 0 -p sin0 0 


We ask you in Problem 6.62 to verify that 

/F(p,0,#) = det£>F(p,0,#) = p 2 sin0. 


Note that for p = 0, 0 = 0 and 0 = 7r the matrix derivative is not invertible and F is 
not one to one when # = 0 and 2n. The change of variable formula can be proved by 
taking the limit as p tends to zero, 0 tends to zero or n, and # tends to 0 or 2 n. See 
Example 6.20 for an analogous argument for polar coordinates. 

Example 6.35. Take D to be the region between the spheres of radius a and 
b centered at the origin. Find the integral of f(x,y,z) = ^x 2 +y 2 +z 2 over 
D. Use the spherical change of coordinate mapping F(p,0,#) that maps the 
rectangular box C given by a <p <b, O<0<7r, 0<#<27 t onto D. By the 
change of variables theorem, 


fj 


x 2 + y 2 + z 2 dxdydz = 


p 2 p 2 sin0dpd0d# 



Fig. 6.43 The cone region D in Example 6.36. 


Example 6.36. Let D be the cone-shaped region in (x 9 y,z) space shown in 
Figure 6.43. D is described in spherical coordinates by0<p<5, 0 < 0 < |, 
0 < # < 2n. The volume of D is 
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r* r*2n rw/ 6 r* 5 

= dxdydz = 

JD 3 6 =0 J(p= 0 Jp - 


Vol (D) = dxdydz 

JD 


'=OJ0=O Jp=0 

■27t p*7r/6 


p 2 sirupdpdfdO 


p2ic / r>n/6 / r>5 \ \ p27T p; 

= I I I p 2 dp sin0d0 d0 = I d0 I 

Jo=0\J<f>=0 \Jp =0 / / J6>=0 J0= 

= (2^)[-cos0j ;r/6 (i5 3 ) = (2jr)(l-#) 1(125). 


sin0d0 I p z dp 

0=0 Dp-0 


Example 6.37. A function / gives the electric charge density [charge/volume] 
inside the unit sphere p = 1 as 

/ (p, 0) = p 2 sin 2 0. 

Find the total charge inside the sphere. 

^2n r*u r*\ 


f fdv= r r r 

3 p< l 30—0 3(p —o 3p ~o 


p 2 sin 2 0p 2 sin0dpd0d# 


p*27T nil n 1 

= I sin 2 #d# I sin0d0 I p 4 dp = 
J6/=0 J0=O Jp=0 


*(2)U) 


1\ _ 2?r 


Variation with a parameter. Often there is a fourth variable in a triple integral. If 
f(x,y,z , 0 is the energy density at (x,y,z) at time t, then the total energy in D at time 
t is 

I f(x,y,z,t)dxdydz, 

3d 

a function of t. What is the rate of change of the total energy in D with respect to 
time? 


Theorem 6.22. Suppose f is a continuous function on Dx [a,b] where D is a 
smoothly bounded set in R 3 . 


(a) Then I f(X,t)dV is a continuous function of t. 

JD 

(b) If f is continuously differentiable in t then 


f f(X,t)dV 
3d 

is a continuously differentiable function oft and 


sl /<x ' ,,dv 


I 


m.ndv 
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Proof, (a) 


f f(X,t)dV- f f(X,s)dV= f (f(X,t)-f(X,s))dV. 
Jd Jd Jd 


According to Theorem 2.12 a continuous function on a closed bounded set is 
uniformly continuous; so for every positive e there is a 6 such that 


\f(X,t)-f(X,s)\<e 


for \t - < S and all X in D. Then for \t - s\ < S the integral of /(X, t ) and the 

integral of /(X, s) over D differ by less than 6Vol(D). 

(b) Continuously differentiable in t implies that for every e > 0 there is a S such that 


/(X,x + /Q-/(X, t) 
h 


~ft(Xj) 


< 6 


(6.24) 


for \h\ < S for all X in D. Here is why: f t is uniformly continuous, so given e there 
is S so that if \t - s\ < e then | f t (X, t) - f t (X , s)\ < e for all X. Take \h\ < S. By the 
Mean Value Theorem 


/(X,f + A)-/(X,f) 


h 


-f,(X,t ) 


= |/ t (X,c)-/ t (X,o| 


< 6 


where \c-t\ < \h\ < 6. 

It follows from estimate (6.24) that as h tends to zero 


f D f(X,t + /i)dV- f D f(X,t)dV _ r f(X,t + h)-f(X,t) 
h Jd h 


converges as h tends to zero to 
part (a). 


I f t (X,t)dV. This derivative is continuous by 

JD 


□ 


Example 6.38. Suppose now that the chemical reaction of Example 6.33 has 
energy density f(x,y,z,t ) = e _? (100 - 5 z) that depends on time. Find the rate 
of change of the total energy in the tank. The total energy is 


/ fdv= I 


e~ t2 (100-5z)dV. 


We showed that | (100- 5z)dV = yg(100 2 - 85 2 ), so the total energy is 


X 


e~‘ (100-5z)dV = e~‘ fg(100 2 -85 2 ). 
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The time rate of change is then 


d 
d t 


X 


e-' 2 (100-5z)dV 


X 


(~2t)e~‘ (100- 


-5z)dV = (-2r)e _f2 j5(100 2 - 


■85 2 ). 


□ 

Example 6.39. Let p(x,y,t ) be a C 1 function that represents the population 
density [people/area] in a region D of R 2 at each time t. Express the population 
of D at time t and its rate of change with respect to time [people/area/time] 
using integrals. The population at time t is 

P(t) = I p(x,y,t)dA. 

Jd 

Its rate of change is P' it) — ~ f p(x,y,t)dA = f p t (x,y,t)dA. □ 

d tJ d Jd 

Probability density functions. The definition of integrable functions over 
unbounded sets in R 3 can be made analogously to the definitions in Section 6.5. 
We also make the following definition of probability density function p from R 3 to 
R similar to the case in R 2 . 


Definition 6.16. A probability density function in R 3 is a nonnegative function 
that is integrable over R 3 and whose integral over R 3 is equal to 1. 


Example 6.40. We have evaluated 



dx = 7r 1/2 . 


—'xn — 2 _ 2 _ 2 

Set p(x,y,z ) = n ^ /z e * y z > 0. Then p is a probability density function: 


f n- 3/2 e- x2 - y2 -* 2 dV = n- 3 ' 2 lim 

Jr3 


r*n r*n r*n 
J-n J-n J-n 


_ 2 _ 2 _ 2 
e~ z e~ y e~ x dxdydz 


X OO /^OO /~*oo 

e~ z2 dz I e~ y2 dy I e“^ 2 dx = 7r _3/2 7r 1/2 7r 1/2 7r 1/2 = 1. 

oo —oo %J —oo 

□ 

Example 6.41. A function of the form 

f(p , 0, 6) = pe _p cos 0 

is used in chemistry to describe a “2p” orbital of an electron. (See also Prob¬ 
lem 9.48.) In the application, the probability density function for finding the 
electron at point is 


p(p,<f>,6) = c(f(p,<f>,6 )) 2 
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f 

Jr- 


where c is chosen to make I p dV = 1. To find c, we calculate the integral 


f P dv= nr ir< 

Jr 3 J6>=o \ J0=o \ Jp=o 


cp 2 e 2p cos 2 0p 2 sin 


j dcp j d0 

[ -|7T p°° poo 

- i cos 3 0 I p 4 e _2p dp = c(27t) § I p 4 e _2p dp. 

J 0=0 J p =o J P =o 

In Problem 6.63 we ask you to show that the last integral is equal to |, thus 


c = 


Problems 


6.57. Let D = [0 ,a] 5 be the box in R 5 consisting of all points X = (xi,X 2 ,X 3 ,X 4 ,xs) 
where 0 < xj < a for j = 1,2,3,4,5. Evaluate the integrals. 

(a) Vol (D) 

(b) f x\ d 5 X 
Jd 

(c) f (v 2 -X 4 + 7 x 5 x 3 )d 5 X. 

JD 

6.58. Find the average value of the function f(x,y,z ) = x 2 +y 2 -z 2 on the sets 

(a) [-U] 3 

(b) x 2 + y 2 + z 2 < 1 . 

6.59. Evaluate the integral xz 2 dE for the two regions. 

Jd 

(a) the rectangular region D = [1,2] x [3,5] x [-1,10] 

(b) the subset of D = [1,2] x [3,5] x [-1,10] where z > x + y- 

6.60. Let D be the cube where x,y,z each vary from -2 to 2. 

(a) Explain by a symmetry argument why I xz 2 dV = 0. 

Jd 

(b) Evaluate j (8x + 2xz 2 - 4 y 2 z +10) dV. 

Jd 

6.61. A planet is represented by a ball of radius R centered at the origin. Its energy 

density [energy/volume] at (x,y,z) is /(x,y,z) = e - Find the total energy 

of the planet. 

6.62. Verify that the Jacobian for the change from rectangular to spherical coordi¬ 
nates is 

JF(p,(f),6 ) = p 2 sin 0 . 
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6.63. In this problem we evaluate the integral 


r*oo 

| p 4 e -2p dp = 
Jo 


that was used in Example 6.41. 


(a) Verify that L e 2p dp 


f o 


(b) Let k and n be positive. Show that 


X n _~-2 n nn -2p 

p k t~ 2p dp = n k -- kp k ~ l — dp. 


«oo 


(c) Let 4 = I 2 p dp. Show that for k > 1, 4 = ~4-i • 

Jo 2 

(d) Deduce that 4 = |. 

6.64. An integral over the unbounded set R n is the limit of integrals over an increas¬ 
ing sequence of smoothly bounded sets D\ c D 2 c • • • D n c • • • whose union is R n . 
Show by a change of variables that 



6.65. Let p be a positive number and define a function / from R 3 to R by /(X) = 0 
when ||X|| < p , and /(X) = e -||X|1 when ||X|| > p. Evaluate the integral 



using spherical coordinates. 

6.66. Let a \,... ,a n be positive constants. Show by a change of variables that 



6.67. Consider the integral over the ^-dimensional cube 



(a) Evaluate I x 2 d"X as an 

J[o,ip 

has the same value. 


(a) Evaluate 


iterated integral. By symmetry each I x 2 d n X 

J[0,i] w J 
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(c) Find the average value of ||X|| 2 on [0, l] n . 

(d) Find the average value of ||X|| 2 on [0,2] n . 

6.68. Let 0 < a < b and consider the integral 

f (x 2 +y 2 +z 2 f dV 

Jd 

where D is the region of R 3 between the spheres of radius a and b centered at the 
origin. For parts (c) and (d), see Definition 6.9 and use the corresponding concept 
of integration over unbounded sets in R 3 . 

(a) Evaluate the integral for the case q = -3/2. 

(b) Evaluate the integral for other values of q. 

(c) Verify that I (x 2 +y 2 + z 2 ) -2 dV exists and find its value. 

J x 2 +y 2 +z 2 > 1 

(d) Verify that I (x 2 + y 2 + z 2 ) -3/2 dV does not exist. 

J x 2 +y 2 +z 2 > 1 


Chapter 7 

Line and surface integrals 


Abstract We use integrals to find the total amount of some quantity on a curve or 
surface in space. Examples include total mass of a wire, work along a curve, total 
charge on a surface, and flux across a surface. 


7.1 Line integrals 

In single variable calculus we saw that the total mass of a straight wire that lies 
on an interval between x- a and x-b and that has density f(x) [mass/length] is 

r h 

I f(x) dx. We also saw that if a force f(x) varies continuously with position v and 


is in the direction of the motion then the total work done in moving from a to b on 

r b 

the v axis is I f(x) dx. In this section we introduce line integrals to find the total 


mass of a curved wire and the total work along a curve. 

Definition 7.1. Let X be a C 1 function from [a, b] to R 3 , X(f) = (x(t),y(t),z(t)), 
with X'(t) A 0 on [a, b] . The range of X, C, is called a smooth curve from X(a) 
to X(b). X is called a smooth parametrization of C. The length of the curve, 
or arclength is 



*b 
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A piecewise smooth curve is a finite union of smooth curves Ci,...,C m each 
joined continuously, but perhaps not differentiably, to the next. The length of a piece- 
wise smooth curve is the sum of the lengths of the smooth parts. 

Next we compute some arc lengths. 

Example 7.1. The curve C\ given by 

Xi(t) = (t,2t,t+1), 0<t<\ 


is a line segment from (0,0,1) to (1,2,2). The length of C\ is the distance 
between the endpoints, 

a /(1-0) 2 + (2-0) 2 + (2-1) 2 = V6. 

Using the definition for the length of C\ we find 

x\t) = (O' = 1, y\t) = (21)' = 2, z'(t) = (t+ 1)' = 1, 

and we get f ds = f s/l 2 + 2 2 + l 2 dt = C V6d £ = V6. □ 

Jc i Jo Jo 

Example 7.2. Find the length of the curve C 2 given by 

X 2 (m) = (e M - l,2e M -2,e M ), 0 < u < log2. 

Since X' 2 (u) = (e M ,2e M ,e M ) we get 

J r- ri og2 I - r*log2 

= I J (e u ) 2 + (2e u ) 2 + (e u ) 2 du = I V6e M dw = V6(e log2 -e°) = V6. 

c 2 Jo y Jo 


Note that C\ in Example 7.1 and C 2 in Example 7.2 are the same curve, 
parametrized differently. Let 

0<w<log2, 0<r< 1, t - e l> - 1. 

Then Xi and X 2 are related by X 2 (u) = X\(t(u)). By the Chain Rule 

X' 2 (u) = X[(t(u))t\u), 

and by the change of variables theorem for integrals 


r-t= 1 log 2 /^M=iogz 

l|X , 1 (0Hdf= \\X' l (t(u))\\\t'(u)\du= \\X' 2 (u)\\du. 

Jt-0 Ju-0 Ju-0 

We show by the same argument that the definition of the length of a curve is 
independent of the parametrization of the curve. For if t- t(u ), a<u<(3 is another 
. dt . . 

parameter with — positive, then 
du 


>u =log 2 
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d X(t(u)) _ dX d t_ 
d t d u 


d u 


and so by the change of variables formula 


Length(C) = 


f 


• fo "dXi 

d t I 


d t - 


dX || d t 
d t II du 


■ du ■ 


f 


'P ii dX dn 
d t du\ 


du - 


dX(t(u)) | 

du 


du. 


Two different interpretations of the integral of ||X'(0II- We saw in Chapter 5 that 
if 


X(0 = (x(t),y(t),z(t)) 

is the position of a particle at time t then X'(t) is its velocity and ||X'(t)|| is its speed. 
Then the integral of the speed over the interval from t = a to t = b is the total distance 
or length of the path traveled from t = a to t = b. 



Fig. 7.1 A parametrization stretches or shrinks subintervals of the domain. 


Alternatively, consider the interval [< a , b ] on the number line and imagine that it 
is stretched like a rubber band onto a curve in space. See Figure 7.1. The part of the 
band that was originally between t and t +At is now on the curve between X(t) and 
X(t-\-At). Originally the length of this part of the band was At. Now that part has 
length roughly the distance between X(t) and X(t+At), or 

||X0+zh)-X(0l|. 

By the Mean Value Theorem that is approximately 

HX'COIIkhl. 

That is, HX'COII is the factor by which the interval from t to t+At was stretched when 
it was mapped to the part of the curve between X(t) and X(t + At). The sum of the 
lengths \\X\t)\\At of the pieces tends to the length of the curve. 

Arc length parametrization. Denote by C a smooth curve of length L. There are 
many smooth parametrizations for C. There is one called the arc length parametriza¬ 
tion that is natural in the sense that it maps the interval [0,L] one to one onto C 
without stretching or shrinking the interval at any point in the process. That is, the 
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derivative —-— at each s in [0,L] is a unit vector, so that the stretching factor, 
as 

||X'(0IU is 1. In Problem 7.3 we show you an example of such a parametrization. 
To see that there is such a parametrization of a smooth curve, let X be a smooth 
parametrization of C with domain [a, b] and define s(t ) to be the length of the curve 
from X(a) to X(t): 

S(t)= f ||X'(T)||dT. 

Ja 

By the Fundamental Theorem of Calculus s'(t) = ||X'(Oil- Since X is smooth, ||X'(0ll 
is not zero, so s'(t ) is positive. Therefore s is increasing and invertible, and we can 
write 1 as a function of s, t = t(s). By the Inverse Function Theorem for single 
variable this inverse function is differentiable and 

d t _ 1 

d~s~ ||X'( 0 lf 

We define the arc length parametrization to be X(t(s)). By the Chain Rule, 


d(X(>0))) _ dX dt _ X\t) 
ds “ ~dtd~s ~ ||X'(0ir 


which is a unit vector. In terms of speed, the arc length parametrization has speed 1 
at each point of the curve. 

We have shown that every smooth curve C has an arc length parametrization. It 
is useful to know that it exists. The next example shows an unusual case where there 
is a simple formula for it. 

Example 7.3. Let C be the helix given by the parametrization 


0 < t < 2n, a 2 + b 2 ± 0. 


X(t) = (acost,asint,bt), 

Then ||X'(0ll = Va 2 + b 2 for each t. If a 2 + b 2 = 1 then X is the arc length 


parametrization of the helix. If not, then s - 


■ f ||X'(T)||dT= A /a 2 + b 2 t , and 

Jo 


X(t(s)) = ^acos(-j==),asin(-j==),b-j=='j, 0<s<2nSa 


+ b 2 , 


is the arc length parametrization. □ 

Line integral To motivate the definition of the line integral consider a wire with 
continuous density f(x,y,z) [mass/length] and assume the wire lies along a smooth 
curve C parametrized by 


X(0 = 


a<t<b. 















7.1 Line integrals 


283 


Partition the curve into short segments C* . By the continuity of /, if the segments 
are short the density doesn’t vary much in each segment. Estimate the mass m* of 
the z-th segment by using the density /(X(^)) at one end of the segment times the 
length of the segment 

m i ~ /(X(^))Length(Q). 

The sum of these estimates is an estimate for the total mass of the wire. (See Fig¬ 
ure 7.2) The length of Q is close to the length of the secant, ||X(^) - X(^_i)||. 
By the Mean Value Theorem and continuity of X', ||X(^-) - X(^-_i|| is close to 
||X'(^)II \U ~ U-\ I- So the mass of the wire is approximately the sum 

* E/( x fe))ii x v,)ii \u - u- ii. 

i i 

The sum on the right tends to the integral 

/(X(O)IIX'(Olldr 

as the subinterval lengths ti - U-\ tend to zero. This motivates the definition of the 
line integral of / over C with respect to arc length. 



Definition 7.2. Let / be a continuous function on a smooth curve C 
parametrized by X(t) = (. x(t),y(t),z(t )), a < t < b. The integral of f over C 
with respect to arc length is 

r fds= f b /(x^iix'wiidL 

Jc Ja 

For a piecewise smooth curve C, we define the line integral to be the sum of 
line integrals over the smooth pieces. 


Just as we saw with the length of C, the integral of a continuous function / over 
a smooth curve is independent of the parametrization. 

Example 7.4. A wire lies along a helical curve C given by 
X(0 = (cos t, sin t,t), 0 < t < 4n 

and its density is f(x,y,z ) = z [mass/length]. Find the mass of the wire. We 
integrate / along C. 



/(x(o) mmt. 

mass/length length 


X'(0 = (-sin^cosL 1) and /(X(f)) = t. The total mass is 
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X-JN 


(- sin t) 2 + (cos t) 2 + l 2 dt- 


r 


tClAt=% 2 


An 


8n 2 V2. 


Example 7.5. The curve C given in Example 7.4 can also be parametrized by 

Xi(r) = (cos(4r), sin(4r),4r), 0 < r < n. 

Using this parametrization for C we get that the integral of f(x,y,z ) = z over 
C is 

f fds = r/(Xi(r))||X , 1 (r)||dr= T(4t)(4 V2)dr = 8 V2^ 2 . 

Jc Jo Jo 


Definition 7.3. Let / be a continuous function on a smooth curve C 
parametrized by X(t) = (. x(t),y(f),z(t )), a < t < b. The average of / over C 
is 

X c /ds /(X(0)||X'(Q||dr 

fc ds ll x '(0lldr 


Example 7.6. In Example 7.4 we saw that the density f(x,y,z ) varied at points 
along C. To find the average density on C we divide the mass of the wire, 


X 


fds , by its length, I ds. We compute 


•X- 

/-» p47T r*47T 

Length(C) = I d.v= I ||X'(r)||df= ZlAt = AnZ2. 
Jc Jo Jo 
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The average density is 


fcf^ s _ 8?r 2 V2 

f c ds 4ttV2 


= In [mass/length]. 


Work along a curve The work done by a constant force F in moving an object 
along a straight line from point A to B is 


F-(B-A). 


The work done by a variable force F in moving an object along a smooth curve C 
from A to B motivates finding the integral of the tangential component of F over C, 
called the integral of F along C. 


Definition 7.4. Let F be a continuous vector field on a smooth curve C given 
by X(t) = (. x(t),y(t),z(t )), a<t<b. The unit tangent vector to C in the direction 
of increasing t is 

T _ X'(r) 

iixwir 

and T orients C from A = X(a) to B = X(b). See Figure 7.3. The integral of F 
along C (in the tangential direction) from A = X(a) to B = X(b) is 


f F • Tdv = pF(X(0)-T(X(0)||X'(J)||df= f* F(X(t))-X' 
JC Ja Ja 


(t)dt. 



Fig. 7.3 The unit tangent vector T at the point X(/). 


If C is piecewise smooth we define the integral of F along C to be the sum of the 
integrals of F along the smooth pieces. 

In Figure 7.4 a curve C is traversed from A to B. If C is traversed in the opposite 
direction from B to A, we denote that curve by -C. Denote the unit tangents as Ti 
on C and as T 2 on -C. The directions of Ti and T 2 at each point are opposite, so 
F • T] = —F • T 2 , and 

L FT2ds= -fc F-Tids. 

Example 7.7. Find the work done by F(v,y,z) = (y, -x,z) in moving from 
(1,0,0) to (1,0,2 n) along the helical curve C given by X(t) = (cost,sint,t), 
0 < t < 2n. At the point X(0 on C, 
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Fig. 7.4 Reversing the direction in which the curve is traversed reverses the direction of the unit 
tangents, and F Ti = -F • T 2 . 


F(X(/)) = (sin/,-cos/,r), X'(f) = (-sinf.cosf, 1), ||X'(/)|| = V2 


, (-sin/,cos/, 1) . 

and T(X(r)) =---. The work is 


V2 


f F • Tds = f 
Jc Jo 

■r 


(sin L- cos t, t) 


(-shU, cosL 1) 

v! 


V2d/ 


(/- l)d/ = 


\t 2 -t 


In 


= 27r - 2n. 


Jo 


Example 7.8. Find the work done by F(v,y,z) = (y,-x,z) in moving from 
(1,0, 27 t) to (1,0,0) along a straight curve C 2 . We parametrize C 2 by 


X(t) = (1,0, 2tt) + £(0,0, -2tt) = ( 1,0, -2nt + 2 tt\ 0<t<l. 


F (X(0) = (0,-l,-27tf + 27r), X'(0 = (0,0,-2 tt), 

||X'(0ll = 2 tt, T(X(0) = (0,0,-1). 

The work done is 


f F -Tds 

Jc 2 


f 


F(X(/))-T(X(/))||X'(/)||d/ = F(X(/)) • X'(/)d/ 


-/■ 


= 0,-1,- 

Jo 


-27it + 27 t) • (0,0, -27t) d£ 


=/' 


47r 2 (/-l)d/ = 2^ 2 (/-l) 2 


= -2s- 2 . 


There are many ways to parametrize the segment in Example 7.8. In Prob¬ 
lem 7.12 we ask you to create two different parametrizations and verify that you 
get the same answer for the work. 


Example 7.9. Find the work done by F(v,y,z) = (y, -x,z) in moving an object 
from (1,0,0) to (1,0,0) along the curve C = C\ U C 2 , where C\ is the helical 
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Fig. 7.5 Work done by F(x,y,z) = (y, —x,z) along the loop C\ U C 2 in Example 7.9 is not zero. 


curve from (1,0,0) to (1,0, 2n) in Example 7.7 and C 2 is the straight curve 
from (1,0, 2n) to (1,0,0) in Example 7.8. The work done is the sum of the 
work along the smooth curves, 


f F • Tds = f 
Jc Jc 


F-Tds 


C1UC2 


= f F-Tds+ f F • Tds = In 2 -In- 2n 2 = -2n. 

Jc 1 Jc 2 


Note that the work by the force F(x,y,z) = (y,-x,z) in moving an object 
around the loop C\ U C 2 is not zero. □ 


Example 7.10. Let F(v,y) = ( ^ X ) and let Cr be a circle of radius 

v * 2 +y 2 x 2 +y 2 ' 

R centered at the origin traversed in the counterclockwise direction. One way 


to find 


f F-Tds 

Jc R 


is to parametrize Cr. Another way is to use the special geometric relationship 
between this vector field at P, F(P), and the tangent to Cr at P, T(P). See 
Figure 7.6. We observe that 


F(v,y) • (v,y) = 0 and T(v,y) • (v,y) = 0 


and that F is a multiple of T at each point on Cr. So the angle between F 
and T is zero and the tangential component of F, F • T = ||F||||T|| cos0, is equal 

to the magnitude of F. The magnitude of F(v,y) is ||F(v,y)|| = . the 

V^+y 2 

reciprocal of the distance to the origin, which on Cr is l/R. Therefore 


/ F ' Td * = / 1 

Jc R Jc r K- 


ds = iLength(Cs) = ^ 

K K 


■ 2n. 


□ 


In Problem 7.6 we ask you to redo Example 7.10 by parametrizing Cr. 
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If we denote 

then 


Fig. 7.6 In Example 7.10, T(P) and F(P) have the same direction at each point P of the circle. 

Evaluation of line integrals For a piecewise smooth curve C parametrized by 
X(t) = (. x(t),y(t),z(t )), a<t <b, 

the integral of the tangential component of F = (/i,/ 2 ,/ 3 ) over C is 

J F-Td s = £ F(X(t))-X'(t)dt. 

Tds = (dx,dy,dz) 

fc FTdS -fc f 1 (x,y,z)dx + f 2 (x,y,z)dy + / 3 (x, y,z)dz 

= J fi (x,y, z)dx+J^ f 2 (x,y, z)dy + J^ f 2 (x,y, z) dz. 

In terms of the parametrization X we then have 

f f\{x,y,z)dx = f /i (X(1))x (t)dl, 

Jc Ja 

f f 2 (x,y,z)dy= f* f 2 {X(t))y'(t)dt, 

JC Ja 

J^f 3 (x,y,z)dz = J fo{X(t))z! 


(t)dt. 


The benefit of this approach is that we may use different parametrizations of C for 
evaluating the integrals of different components of F. Let’s look at an example. 

Example 7.11. Find the integral of 

F(x,y) = (x 2 + 3,2) 
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4 _ 


2 - 


Fig. 7.7 The curve in Example 7.11. 


along the curve C shown in Figure 7.7, from (1,4) to (3,4). One way to 
approach this problem is to parametrize the three segments C\,C 2 ,Cs and 
find the sum of the line integrals. Let’s look at the integrals of the component 
functions over C and see whether it helps simplify the problem. 


X FTds= X 


(x 2 + 3)dv + 2dy = J* (v 2 + 3)dv + J* 2dy 


Jc Jc 

Consider the integral of the first component 


J* (x 2 + 3)dv = J* (v 2 + 3)dv+ | (v 2 + 3)dv+ | (v 2 + 3)dv. 


r (v 2 +3)dv+ r 

Jc 7 Jc 

X 


On Ci, x'(t) = 0 because there is no change in v. So I (x + 3)dx = 0 and 

Jci 

(v 2 + 3)dx = 0 for the same reason. The curve C 2 is parallel to the v axis 


l: 


so we can take X(t) = (t,2), 1 < t < 3. Then x = t, dx = dt and 
-3 ^3 r .3 


r (v 2 + 3) dx = r ( t 2 + 3) d t = r (v 2 + 3)dv = 
Jc 2 J*=i Jx= 1 


^x 3 + 3v 


- 26 6 _ 44 

“ 3 + ° “ 3 • 


Similarly considering the second component function 


f 2 dy = 2 f f dy + f dy+ f dy) = 2(2-4) + 0 + 2(4-2) = 0, 
Jc \JC\ Jc 7 Jc 3 ) 

>X r 


Td.?= y. □ 

Application of line integrals to circulation The velocity of a fluid flow in the 
plane, at a moment in time, may be represented as a vector field 


U(v,y) = (w(*,y),v(*,y)). 
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The functions u and v are the v and y components of the velocity at (v,y). 

Example 7.12. (i) The velocity field U(v,y) = (y,0) represents a flow parallel 
to the v axis, that depends only on y. The region 0<y< 1 models the flow of 
a river of depth 1. See Figure 7.8. 

(ii) The velocity field U(v,y) = (-y,x) represents a flow that rotates coun¬ 
terclockwise around the origin, with speed ||U(v,y)|| = ^x 2 +y 2 equal to the 
distance to the origin. □ 



I. • LL-: : : ; 


Fig. 7.8 Sketches of vector fields from Example 7.12. Left: U(jt,y) = (y,0). Right: U(jt,y) = (-y, jc). 
In contexts where U is the velocity field of a fluid flow, the line integral 



is called the circulation of U along C. Of particular interest is circulation along a 
curve that is a loop. Let’s look at some examples. 



u 


Fig. 7.9 The circulation of U along C is negative. 

Example 7.13. The information about the vector field U and the curve C in 
Figure 7.9 indicates that U • T < 0 at every point of C. So the circulation of U 
along C is negative, 



If we traverse C in the opposite direction, then the circulation of U along the 
resulting curve -C is positive, 
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U-Tds > 0. 


□ 

Example 7.14. Let C be a circle oriented clockwise as shown in Figure 7.10. 
Unit tangent vectors T are sketched lightly. The velocity vector field 

uu>-) = (y, 0) 


is sketched using darker vectors. Is the circulation of U along C, 

J U-Tds 

positive, negative, or zero? Let P and Q be two diametrically opposite points on 
C, with P on the upper half of C, C \, and Q on the lower half C 2 . Unit tangent 
vectors T(P) and T(Q) point in opposite directions, and velocity vectors U(P) 
and U(Q) only differ in length, with U(P) longer than U(Q). We see that 

U(P) • T(P) > 0, U(Q) • T(Q) < 0, 


and U(P) • T(P) > U(Q) • T(P) = U(Q) • (- T(Q)). Therefore 


r u*Td^>- r u 

JC 1 JCo 


Tds 


)c x Jc 2 

and the total circulation along C is positive: 


r u-Tds = r u • Tds + r 

Jc JC 1 JC 


U-Tds+ U-Tds>0. 

Ci Jc 2 


□ 



In Chapter 8 we will see that the circulation of a vector field around a small closed 
loop is related to the local rotation, called the curl of the velocity field, curlU(P), 
that we defined in Section 3.5. 
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Example 7.15. Let U(x,y) = 




, a constant. Find the circulation of U 


(x 2 +y 2 ) a ’ 

along a circle C of radius r centered at the origin, and oriented counterclock¬ 
wise. 

At each point ( x,y ) of C, the vector ( -y 9 x ) is orthogonal to (x,y), and its 
direction agrees with the unit tangent to C. Its norm is ||(-y,x)|| = r. There- 

l y2 +x 2 i 

fore the unit tangent vector is T = -(-y,x). So U • T = ——— =- 

y yZCl~\- 1 

circulation is 


r 2a-l 


. The 


f U-Tds=A- j ds=-d-27r r = 

Jc r 2a-l J c r 2a-\ 


2 n 

~2a-2 ' 


□ 


Example 7.16. Note the special case a = 1 in Example 7.15. The circulation 
of 


U (x,y) = 


(-y,x) 

x 2 +y 2 


along circles x 2 + y 2 = r 2 is 2n, independent of the radius r. As we will see in 
Section 8.1, along circles C that do not enclose the origin, the circulation of 
U is zero. In Problem 7.17 we outline a geometric reason for this fact. □ 
Flux across a curve in R 2 . The integral of the tangential component F • T of a 
vector field F along an oriented curve C can be interpreted as work or as circulation 
along the curve. The integral of the component of F normal to C is called the flux of 
F across C. 

Let C be a smooth curve in the plane, parametrized by X(t) = (x(t),y(t)) with 
X'(t) = ( x'(t),y'(t )) ^ 0. There are two unit normal vectors to C at (x(t),y(t)), 


_ (~y'(t),x'(tj) _ (/(0,-x'(0) 

l|X'(f)|| ’ " l|X'(0ll 

They are orthogonal to X'(t) and point in opposite directions. 


Definition 7.5. Let C be a smooth curve parametrized by X(t) = (x(t),y(t)), 
a<t <b , and let F = (/i,/ 2 ) be a continuous vector field on C. The flux of F 
across C in the direction of the normal 


N = N(x(0,y(0) = 


(-y (0.^(0) 

l|X'(0ll 


IS 


X F 


Nd^ 


= f b F(X(t)) 

Ja 


(-y (0.^(0) 

IIX'WII 


||X'(0l|d/ = JT- 


-fidy + fidx. 


Example 7.17. Find the flux of F(x,y) = (x,y) outward across the circle C of 
radius 3 centered at the origin. C can be parametrized by 
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X(f) = (3cosf,3sinf), 0<t<2n. 

Then X'(t) = (- 3 sinL 3 cos t) and T(T) = (- sinL cos t). The two unit normals 
are 

N(0 = (cos t, sin t), -N (t) = -(cos t, sin t). 

N points outward at each point on C. The flux outward across C is 

f F-Nds = f (3cosf,3sinf)-(cosf,sinOI|X'(OI|df 
Jc Jo 


r*2n 

-1 : 


3(cos 2 ^ + sin 2 03dr = 9(27 t) = 187T. 


An alternative way to compute the integral is to observe that at each point 
on C, F and N are parallel. So F-N = ||F||||N||cos0 = ||F||. This is equal to 
^x 2 +y 2 - 3 since (x,y) is on C. Therefore 

J F -Nds = J 3ds = 3Length(C) = 3(2 tt(3)) = 1 8n. 


Line integrals in R n . Analogous to Definition 7.1 we define a smooth curve C in 
R n to be the range of a C 1 function X(t) = (xi (t ),..., x n (t)), a<t<b , with X'(t) ^ 0. 
X is called a smooth parametrization of C. The length of C is defined as 


Length(C) 


- f IX- 

Ja 


(OlldL 


We note that the length of a curve as defined above is independent of the parametriza- 

d t 

tion of the curve. For if t = t(u ), a <u <(3 is another parameter, with — positive, 

d u 

then 

dX(fi») _ dX d t 
d u d t d u 

and so by the change of variables formula 


t riidXi. fnidXii dt 

Len8 “ ,(C)= l Ul d '=l bb 

'P\\dX(t(u)) 


d u 


dX dt || , 

~i~T~\\ du 
dt di/N 


-f 


du 


du. 


Given a function / from R n to R that is continuous on C, the line integral of f 
over C is 


£fdS = £ /(X(O)IIX' 
In particular taking f(X) - 1 we get 


(Oil dr. 
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Length(C) = ds = fax' 
We define the average of / on C as 


(r)|| dr. 


MO £ 


Length(C) 


fds. 


Example 7.18. Show that the average of the first coordinates of the points on 
the line segment from A = (a \,..., a n ) to B = (b \,..., b n ) in R n is 

\i a \ +b\). 

Let X(0 = A + r(B - A), 0 < t < 1 parametrize the segment from A to B and let 
f{x i,..., x n ) = x\. The average of / over the segment is 

+tjbi -fli))||B-A||df 
Jq 1 l|X'(0lldr f 0 ‘ ||B — A||d/ 

l|B — A|| C{a\+t(b\-a\))dt 
= -- = 2(01 +fol) - 

In Problem 7.5 we ask you to show this is true for each coordinate. □ 

Let F be a vector field 


F(xi,...,x„) = (/i(xi,...,x„),...,/„(xi,...,x„)) 


that is continuous on a smooth curve C parametrized by X, 


X(0 = (*1 (0, • • •, X n (t)), a<t <b. 


We denote the unit tangent in the direction of increasing t as T(X(7)) = 
in R 3 the line integral of the tangential component of F along C is 


As 

l|X'(r)ir 


f F-Tds = f b F(X(t))- 
JC Ja 


X'(t) 

IIX'WII 


llX'COIIdf: 


: f F (X(O).X' 
Ja 


{t) dt. 


Example 7.19. Define F(X) = X in R n and let C be a smooth curve from a 
point A = Xia), to B = X(b). Then 


J F(X)-Tds = £ X(t)-X'(t)dt 

= f (x l (t)x[(t) + X2(t)x' 2 (t) + --- + x n (t)x' n (t))dt 
Ja 








7.1 Line integrals 


295 



i(||B|| 2 -||A|| 2 ). 


□ 


Problems 

7.1. Let C be a line segment in the x,y plane with endpoints A and B. Consider 
two parametrizations of C: first X(t) = A + t(B - A) for 0 < t < 1, and second 
Y{u) = B + ^u(A - B) for 0 < u < 2. Calculate the following integrals using both 
parametrizations. 

(a) J" d s, 

(b) J' yds, 

(c) The average of y on C. 



Fig. 7.11 The curves C\ and C 2 in Problem 7.2 are line segments. 

7.2. Let Ci and C 2 be the segments in Figure 7.11, and denote by Ci U C 2 the com¬ 
bined curve consisting of Ci followed by C 2 . Find the value of the line integrals by 
using their geometric meaning. 


(a) 

f ds 

JCi 

(b) 

f ds 

Jc 2 

(c) 

f ds 


jC\ UC2 


7.3. Let X(7) = A + t(B - A), 0 < t < 1, parametrize the line segment C from A 
to B. Find a number k so that the substitution t = ks gives a new parametrization 
YO) = X(ks), with 

Y(0) = A, 0 < j < Length(C), Y(Length(C)) = B, and ||Y'0)|| = 1. 

Y(s) is the arc length parametrization of C. 
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7.4. Let C be the smooth curve in R 4 given by X : [0, 2n] —> R 4 , 
X(t) = (cos t, sin t, cos kt, sin kt) 


and k is a constant. 

(a) Find X'(t) and the unit tangent vector T at X(t). 

(b) Find the length of C. 

(c) Verify that for each number k the curve C lies on the sphere 

x 2 + + X 2 + X ^ — 2 . 

(d) Let f(x\, X2, X3, X4) = x\ and g(x\, X2, X3, X4) = X4. Show that the average of / on 
C is 0, but the average of g is not zero unless k is an integer. 


7.5. Show that the average of the i -th component xi of the points on a line segment 
from A = (a \,..., a n ) to B = (b \,..., b n ), in any number of dimensions, is 

2 


7.6. Let F(v,y) 


x 2 +y 2 


, and let Cr be the circle of radius R centered at the 


origin and traversed in the counterclockwise direction. Use the parametrization 
X(t) = (Rcost,Rsint), 0 < t < In to calculate 


f F • Tds. 

Jc R 


7.7. Let C be the triangle with vertices (0,0), (1,0), and (1,1), traversed counter¬ 
clockwise. Evaluate the line integral 


J* y 2 dx + xdy. 


7.8. Let F = ( p , q ) be a constant vector field, and C a line segment in the plane. 
Choose one of the two unit normal vectors N for C. Show that the flux of F across 
C is given by 

J F ■ Nd.y = F • NLength(C). 

7.9. A smooth curve C has unit tangent vectors T = (£ 1 ,^ 2 )- Take the unit normal to 
C to be N = (t 2 ,-ti) at each point. Let F = (/i,/ 2 ) be a vector field, and define a 
vector field G = (/ 2 , -f\) at each point. Show that 


£ FTds -£ 


G-Nds. 


7.10. Let C be the line segment from (0,0) to (4,3), parametrized by 
(x(t),y(t)) = (p, |t), (0 < f < 5) 
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and let C a be the circle parametrized by 


X(0 = {a cos t, a sin t ), 0 < t < 2n, a> 0. 


Let F = (8,0), a constant vector field. 

(a) Find the normal vector N (x(t),y(t)) to C, that is obtained by rotating the unit 
tangent vector T ninety degrees clockwise. 

(b) Find the flux of F across C. 

(c) Find the outward pointing unit normal vector N a at each point of C a . 

(d) Find the flux of F outward across C a . 

7.11. Let a smooth curve C be the graph of y = f(x ) on [a,b]. Use the parametriza- 
tion 


X(0 = (t,f(t)), a<t <b 


to show that the length of C is given by 



Verify that it gives the right answer for y = 3x, 0 < x < 1. 

7.12. Find the work done by F(v,y,z) = (y, -x,z) in moving from (1,0,27r) to (1,0,0) 
along a straight segment C. Create two smooth parametrizations for C and verify 
that you get the same total work with each one. 

7.13. Let Ci be the unit circle centered at the origin and C 2 be the boundary of the 
2 by 2 square centered at the origin of R 2 . Use symmetry arguments and properties 
of integrals to show without calculation: 



7.14. Let C be the piecewise smooth closed curve shown in Figure 7.12 consisting 
of graphs, Ci and C 2 , of two functions g\ and g 2 , traversed clockwise. Let F(x,y) = 
(y,0). The work done by F is 



(a) Write a parametrization for C 2 in terms of g 2 , and show that 


f F • Tds = f g 2 (x)dx. 

C'i a 
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Fig. 7.12 The curves in Problem 7.14. 


(b) Write a parametrization for C\ in terms of gu and show that 


f F • Tds = - f 

JC i Ja 


8 i(x)dx. 


(c) Show that the area between the graphs of gi and g 2 is J" vd.v. 




Fig. 7.13 The curves in Problem 7.15. 

7.15. A container ship is driven by a large diesel engine. The engineer has measured 
the pressure p in the cylinders at each piston position x. See Figure 7.13. The curve 
marked K is while the piston moves upward to compress the air, and the curve 
marked E is while the burning fuel expands to force the piston down. The work 
[Joules] done in one cycle is the line integral 

I pdx. 

Jkue 

(a) Do I pdx and I pdx have the same, or opposite signs? Why? 

Jk Je 

























7.1 Line integrals 


299 


(b) Use the results from Problem 7.14 to show that the work done in one cycle is the 
area between the graphs. 

7.16. Find the circulation of the vector fields 

(a) = (-3y,2x) 

(b) \(x,y) = -5U (x,y) = (15y,-10x) 

counterclockwise around the circle of radius R , x 2 +y 2 = R 2 . 



Fig. 7.14 Figure for Problem 7.17. 


7.17. Justify the following items to show that the circulation of 


U(x,y) = 


(~y,x) 
x 2 +y 2 


around any circle that does not enclose the origin is zero. See Figure 7.14, where W 
denotes unit vectors orthogonal to each radial line. 


(a) The integral 


X 1 


U-Td j 


can be well approximated by a sum over pairs of segments As i, As 2 using small 
angles A6 as shown, where r\ and r 2 are distances from the origin to the segments. 

(b) U = ±W 

(c) AG w -2 w-(Tiz)^i) w 2w-(T 2 zJs 2 ) 

n r 2 


(d) 


J' U • Td.v« ^ |L W • (T 2 As 2 ) + — W • (Tizl.vi )j is approximately zero. 


7.18. Let F(x,y) = Vlogr, where r = yj'x 2 +y 2 , and let C be the circle x 2 +y 2 = a 2 . 
Find the flux of F outward across C, and show that it is independent of the radius a. 


7.19. Evaluate the following line integrals along C, the line segment from A = (0,0) 
to B = (3,4). Which ones are equal to the expressions on the right hand sides? 
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(a) 

(b) 

(c) 


J* xdx =? ^x 2 ] 

X 
X 


x&y =? [x>’]a 
dx + 5dy =? [x + 5y] 


B 

A 


7.2 Conservative vector fields 

Some vector fields have the property that if you integrate along any piecewise 
smooth closed curve , X(t), a<t<b where X(a) = X(b ), then the integral is zero. 
We saw in Example 7.9 that not all vector fields have this property. In this section 
we develop some criteria for determining which fields do have it. 


Definition 7.6. We say that a vector field F is conservative if it is the gradient 
of a continuously differentiable function g, 

F(P) = Vg(P) 

for all P in the domain of F. The function g is called a potential function of F. 


Example 7.20. Let g(x,y,z) = ~(x 2 + y 2 +z 2 ) ^ 2 = — 77^77 * ^ ave seen 

l|X|| 


Vg(x,y,z) 


(x,y,z) 

(x 2 +y 2 + z 2 ) 3 / 2 


X 


1 X 


is the inverse square vector field. This shows that the inverse square vector 
field is conservative and that isa potential function. □ 


Theorem 7.1. Suppose F is a continuous vector field from an open connected 
set D in R n to R n , n> 2. Then the following three statements are equivalent, 
that is, each implies the others. 

(a) ¥ is conservative, 

(b) For every piecewise smooth closed curve C in D, J F • Tds = 0. 

(c) For any two points A and B in D, and for any two piecewise smooth curves 
C i and C 2 in D that start at A and end at B 


f F-Tdj = 

Jc x 


f F Tds 

Jc 2 
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Proof. We show that (a) implies (b), (b) implies (c), and (c) implies (a). From that 
it follows that each statement implies both of the others, and so if one is false the 
others must be false too. We write out the case for a vector field from R 3 to R 3 . 
Suppose (a) holds. Then there is a function g such that 


F (x,y,z) = (, g x (x,y,z),g y (x,y,z),g z (x,y,z )). 


For a smooth curve C in D given by X(t), a<t <b , 

n r*b r*b 

F -Tds = I F(X(r)) ■ X'(t)dt = I Vg(X(0)-X' 

JC Ja Ja 




By the Chain Rule for curves, Vg(X(0) ■X'(f) = —g(X(t)). Therefore 


JV Td s = £ Vg(X(t)) ■ X'(t) dr = £ £(X(t)) dr = g(X(b)) -g(X(a)). (7.1) 

If C is closed then X(Z?) = X(a) and g(X(b)) - g(X(a)) is zero. If C is piecewise 
smooth, say one smooth piece from X(a) to X(c) and a second smooth piece from 
X(c) to X(b) = X(a), then we evaluate the integral over the smooth pieces, getting 

g(X(c)) - g(X(a)) + g(X(b)) - g(X(c)) = -g(X(a)) + g(X(b)) = 0. 


Similarly for any number of smooth pieces. This proves that (a) implies (b). 

Suppose (b) holds. Let A and B be two points in D. Let C\ and C 2 be any two 
piecewise smooth curves in D that start at A and end at B, and let C = C\ U (-C 2 ). 
Then C is a piecewise smooth closed curve, and by (b) 

J F-Tds = 0. 


Using properties of line integrals we have 



This proves that (b) implies (c). 

Suppose (c) is true. We state without proof that open connected sets in R 3 have 
the property, that every two points in the set can be joined by a piecewise smooth 
curve in the set. Now let A be a point of D. Because D is open and connected, for 
each point (v,y,z) in D there is a piecewise smooth curve C in D from A to (v,y,z). 
Define a function g by 
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Fig. 7.15 A curve from A to (x,y,z) that ends parallel to the x axis. 


g(x,y,z) = J* F • Tds, (x,y,z) in D, C any curve in D from A to (x,y,z). 

We will show that Vg = F. Since (c) is true, any curve we choose from A to (x,y,z) 
will result in the same number g(x,y,z). Since D is open and connected there is a 
point (c,y,z) in D with c < x, so that the straight line C\ from (c,y,z) to (x,y,z) is 
in D and can be combined with a curve C 2 in D from A to (c,y,z). See Figure 7.15. 
Using these curves, 

g(x,y,z) = f F• Tds = f F• Tds+ f F-Td^. 

Jc=ciuc 2 Jc 1 Jc 2 


Parametrize C\ by Xi(0 = (t,y,z), c <t < x. Writing components of F = (/i,/ 2 ,/ 3 ) 
we get 


J F Td s = £ F(t,y,z)■ (l,0,0)df 
The derivative of g with respect to v is 



f\(t,y,z)dt. 


g x (x,y,z)= ^-( f fi(t,y,z)dt+ f F-Td^ 

OX \J C Jc 2 

The second integral does not depend on x so its derivative with respect to x is zero. 
By the Fundamental Theorem of Calculus, 


gx(x,y,z) 


d 

dx 



,y,z)dt = fi(x,y,z). 


Similar arguments show that g y = fa and g z = fa . So Vg = F, and F is conservative. 
This concludes the proof that (c) implies (a), and is the last implication in our chain. 

□ 


In the proof of Theorem 7.1 we showed that if F = Vg then the integral of F 
along every piecewise smooth curve from A to B is simply g(B) - g(A). (See equa¬ 
tion (7.1).) This is often called the Fundamental Theorem of Line Integrals. 
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Theorem 7.2. Fundamental Theorem of Line Integrals. If C is a piece- 
wise smooth curve from A to B in the domain of a continuously differentiable 
function g, then 


X 


Vg • Tds = g(B) - g(A). 


We say the integral of a conservative vector field is independent of path since 
g( B) - g( A) depends only on the values of the potential function at the endpoints of 
the curve. 

Knowing that a vector field is conservative is helpful in evaluating line integrals. 



Fig. 7.16 The curve in Example 7.21. 

Example 7.21. Find the integral of F(x,y,z) = (x,y,z) along the curve C shown 
in Figure 7.16. By inspection we see that F is conservative because 

F (x,y,z) = Vg(x,y,z) = V(±(x 2 +j 2 + z 2 )). 

By the Fundamental Theorem of Line Integrals, 

J^F-Tds = g(l,0,0)-g(l,0,l)= 5 - 1 = -j. 

□ 

Example 7.22. Find the integral of F(x,y,z) = (x 2 +y,siny,z) along the curve 
C shown in Figure 7.17. We find by inspection that 

vQx 3 - cosy + = (v 2 ,siny,z). 

We can write the integral as a sum, 

J* F-Td.s'^ J* ((x 2 +y)dv + sinydy+ zdz) 
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U 


x 1 dx + sinydy + z 


+ x 


ydx. 


Jc Jc 

The first integral can be evaluated by the Fundamental Theorem of Line Inte¬ 
grals: 

J* x 2 dv +sinydy + zdz = J* vQx 3 -cosy + ^z 2 ) • Tds 


c 

13 12 

^x -cosy + 2 z 


(0,2,3) 


: (- cos(2) + i3 2 ) - (i2 3 - 1 + o'). 

J (2,0,0) V ’ VJ ’ 

For the second integral we see there is no change in v along C\ or C 2 . There¬ 
fore 


Jc 3 

Again by the Fundamental Theorem 


J y dx= I ydx= I 2dx. 

c Jc 3 Jc 3 


f 2dx= f ' 

Jc 3 Jc 3 


V(2x) • Tds = 


2x 


(0,2,3) 


= -4. 


(2,2,1) 


Thus J F-Tds = -cos(2) + §- f + 1-4. □ 

In Example 7.21 we saw that F(x,y,z) = (x,y,z) is conservative by producing a 
potential function g. In Example 7.9 we saw, as a result of calculating some integrals, 
that F(x,y,z) = (y,-x,z) does not satisfy the independence of path property, so by 
Theorem 7.1 (y, -x,z) cannot be a conservative vector field. The next theorem gives 
us a useful criterion for a vector field in R 3 or R 2 to be conservative. 


Theorem 7.3 .(a) If F is a conservative C 1 vector field from R 3 to R 3 , then 

curlF = 0. 

(b) If ¥ is a conservative C 1 vector field from R 2 to R 2 , then curlF = 0. 


Proof We prove part (a). If F = Vg then F = C/ 1 ,/ 2 ,/ 3 ) = (g x ,g y ,gz) and 


curlF — (fy y fii z , f$x + f\ziflx fly) — (gzy gyz> (§zx gxz)>8yx gxy )• 
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Since F = Wg is continuously differentiable, g is twice continuously differentiable, 
so its second mixed partial derivatives do not depend on the order of differentiation. 
This shows that curlF = 0. □ 

Example 7.23. Let F(x,y,z) = (y,-x,z). Then 


curlF = (0 - 0, -(0 - 0), -1 - 1) = (0,0, -2). 


According to Theorem 7.3, since curlF ^ 0, F is not conservative. □ 

The next example shows that while curlF = 0 is a necessary condition, it is not 
sufficient to imply that F is conservative. 



Fig. 7.18 Sketches of the vector field in Example 7.24. F is not conservative. 


Example 7.24. Let F(v,y,z) = 


0 - , , , * , ,o), defined for (x,y) t (0,0). 
x z +y z x z +y z ) 


Then 


curlF = 


|o- 0 ,-( 0 - 0 ), 


2 2 2 2 \ 
T-xr y -Jr \ 

(.x 2 + y 2 ) 2 (x 2 + y 2 ) 2 / 


= 0 


Let Cr be the circle of radius R in the x,y plane, centered at the origin traversed 
counterclockwise. See Figure 7.18. Notice that at each point along the circle 
Cr, the vector F(x,y, 0) has magnitude R~ l and points in the same direction as 
the unit tangent T(x,y,0). Hence F T = R~ l and 


( F Td s = R ^engti^C/O = R l 2nR = 2n. 

Jc R 


Even though curlF = 0, F is not conservative since the integral along the 
closed curve Cr is not zero. □ 

Example 7.25. Figure 7.19 shows some level sets for a C 1 function / in R 2 , 
and some piecewise smooth curves C\ and C 2 in R 2 . 

If F = V/, find f F • Tds and f F • Tds. 

Jc 1 Jc 2 


f F-Tds 

JC! 



• Tds = /(Q) - /(P) = 30 - 40 = -10. 
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Fig. 7.19 Level sets and curves in Example 7.25. 

The line integral 

f F • Tds = 0, 

Jc 2 

because F is conservative and C 2 is a closed curve. □ 

Example 7.26. Show that F(v,y) = (x + y,y 2 + x) is conservative by finding a 
potential function g so that F = Vg. That is, 

gx = x+y, g y =y 2 + x. 

An antiderivative for g x with respect to v is 

g(x,;y) = \x 2 + xy + h(y ) 

where h(y) is some function of y. The partial derivative with respect to y is 
then 

g y = x + h'(y). 

Take h(y) = |y 3 so that g y =y 2 + x. We have found 

V^x 2 +xy+\y i ) = (. x+y,y 2 + x). 


Example 7.27. Is F(x, v) = (x+y,y 2 ) conservative? We compute 


□ 


curlF(x,y) = fix ~f\y = 0 - 1 + 0. 
By Theorem 7.3, F is not conservative. 


□ 


Problems 


7.20. Find a potential function for each of these conservative vector fields. 
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(a) (*,0,0) 

(b) «Uy) 

(c) a constant vector field ( a , b, c) 

(d) F + G, if F and G are each conservative. 


7.21. Let 


G(x,y) = ( 


x 2 +y 2 ’ x 2 +y 2 


), F(x,y) = ( 


-y 


x 2 +y 2 ’x 2 +y 2j 


(a) Find a potential function g(x,y) so that Vg = G. What is the domain of g ? 

(b) Find V tan -1 (-). Why is tan -1 (-) not a potential function for F? 

7.22. Show that the vector fields in the integrands below are conservative by finding 
potential functions, and evaluate the integrals on any smooth curve C from (0,0,0) 
to (a,b,c) using the Fundamental Theorem of Line Integrals. 


X 

X 


z 2 dz 


(b) J V(xy) • Tds 

(c) I zdx + ydy + xdz 


7.23. Let C be a smooth curve from (0,0,0) to ( a , b , c). Which of the integrals below 
can be evaluated by the Fundamental Theorem of Line Integrals? Find the values of 
those that can. 


(a) 

(b) 

(c) 


x~ civ 


X 

J r (V(xy) - 3 V(z 2 cos y )) • Tds 
d*+ dy 


X 


7.24. (a) Use the fact that 


to evaluate 


= vf^-) 

(x 2 + y 2 ) 2 ^ x 2 + y 2 ' 


X; 


- dx + ■ 


y 


■dy 


)c (x 2 +y 2 ) 2 (x 2 +y 2 ) 2 

where C is any smooth curve from (1,2) to (2,2) not passing through the origin, 
(b) Why do we restrict C to curves that do not pass through the origin? 

7.25. (a) Use the fact that V(||X|r 1 ) = -||X|r 3 X to evaluate 

-X\ dx\ - X2 dX2 - *3 d *3 


X 


c c x 2 +x 2 +x 2 yi 2 
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where C is any smooth curve from (1,1,2) to (2,2,1) not passing through the 
origin. 

(b) Why do we restrict C to curves that do not pass through the origin? 


7.26. Each of the vector fields below is some variant of the inverse square field 

iixir 3 x=v(-iixir 1 ). 


Find a potential function for each of them. 


(a) 

(b) 

(c) 

(d) 


2(-x, -y, -z) 


( x 2 +y 2 +z 2f/2 

3 

(x 2 +y 2 +z 2 ) 3 ^ 2 
(x,y-5,z) 


( x,y,z) 


(x 2 + (y-5) 2 +z 2 ) 3/2 

3(x+l,y-5,z) (x,y,z) 

((x+l) 2 + (y- 5) 2 + z 2 ) 3/2 + (x 2 +y 2 + z 2 ) 3 ' 2 


7.27. The inverse square vector field in Problem 7.26 finds uses in modeling gravity 
forces of point masses, and of electrostatic forces of point charges. Let Pi,..., P& be 
k different points of R 3 where the masses or charges are located. Let c\,...,Ck be 
numbers that represent the masses or charges, all of one sign in the case of gravity. 
Then the vector field 


k 

F(X)=yy 7 iix-p / r 3 (x-p 7 ) 

7=1 

can represent the force on another mass or charge due to the k given particles. Show 
that F is conservative by finding a function g(X) so that F = Vg. 



Fig. 7.20 The orbit of the moon around the earth, for Problem 7.28. 


7.28. The elliptical orbit of the moon around the earth can be expressed in the form 
discussed in Chapter 5, 
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y = -19 


^i 2 +3> 2 + 7.2x 10 9 . 


[meters] 


See Figure 7.20, where the ellipse is to scale, and almost circular. (Moon and Earth 
are drawn too large so you can see them better.) 


(a) Verify from this orbit equation that the Earth-Moon distance at apogee, that is 
farthest from Earth, at the bottom of the figure, is 400 x 10 6 meters, and at perigee, 
closest to Earth at top of figure, 360 x 10 6 meters. 

(b) Over a six month interval, the work done on the moon to pull it from apogee 
down to perigee is 

f F Td s- 

Jhalf orbit 


where the force is 


F = -mMGV 



m and M are Moon and Earth masses, and G the gravity constant. Use the Fun¬ 
damental Theorem 7.2 to show that the work is equal to 


mMG ( 


^/perigee ^apogee 


)• 


7.29. Let the conservative field 

F(x) = iixir 3 x = v(-iixir 1 ) 

represent the electric field due to a positive charge at the origin. 

(a) For each small h > 0 the field + ^ ^ ^)) —F(X) represent the electric 

h 

field due to a strong positive charge at (-h, 0,0) and a strong negative charge at 
the origin. Show that it is conservative. 

F(X + (/*,0,0))-F(X) . 

(b) Show that lim---is conservative. 

h —>0 h 

(c) Denote by g a function that is three times continuously differentiable, and set 
G = Vg. Show that each of the partial derivatives G x , G v , and G z is a conservative 
field. 


7.30. We have seen in Problem 7.27 that a finite number of point charges produce 
an electric field that is a linear combination of inverse square fields. Suppose there 
is a continuous distribution of charge density c(X) [charge/volume] at each point X 
in a bounded set D in R 3 . The potential at any point P outside of D is 

s<p> =Xiira d ^ 

(a) Suppose P is at least one unit distance from every point of D. Show that the 
integrand 
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c(X) 

IIX-PII 

is a bounded continuous function of X in D. 

(b) Integrate the Taylor approximation (See Problem 4.29) 

iix-pir 1 «iipir 1 +iipir 3 p-x+i||pir 5 (3(p-x) 2 -iipii 2 iixn 2 ) 

to get an approximation for the potential: 

g(P) * (£c(X)dyJ||P||- 1 + g|j^c(X)^dyJpjP||- 3 

(X c (X)3^,dy)p, w ||Pir 5 -^c(X)||X|| 2 dy||Pir 3 . 

(c) Show that the integrands c(X)xj, c(X)xkXj , and c(X)||X|| 2 are bounded continu¬ 
ous functions in D. 

(d) Show that as P tends to infinity the three terms of the approximation are bounded 
by ai||P|[ -1 , <Z 2 l|Pir 2 » tfsIlPir 3 , for various constants 

Remark. The gradients of the three terms of the approximation are known in Physics 
as the Coulomb, dipole, and quadrupole parts of the field. 

7.3 Surfaces and surface integrals 

We introduce smooth surfaces in R 3 , using parametrizations. 


Definition 7.7. Let X be a C 1 function from a smoothly bounded set D in R 2 to 
R 3 , denoted X(m, v) = (x(u 9 v) 9 y(u 9 v),z(m, v)). Suppose X satisfies the following 
conditions on the interior of D. 

(a) X is one to one. 

(b) The partial derivatives of the component functions of X are bounded. 

(c) The partial derivatives 

X„(m,v) = (x u (u 9 v),y u (u,v),z u (u 9 v)), 

X v (m,v) = (x v (u 9 v) 9 y v (u 9 v),z v (u,v)) 

are linearly independent so that X M (w, v) x X v (m, v) ^ 0. 

The range S of X is called a smooth surface , parametrized by X. The plane that 
contains the point X(m, v) and has normal vector X m (m, v) x X v (m, v) is called the 
plane tangent to S at X(w, v), and X is called a parametrization of S . 
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Fig. 7.21 The surface in Example 7.28. 

Example 7.28. Let 

X(u , v) = (3 cos v sin u , 3 sin v sin u , 3 cos i/) 

with 0 < u < 7i and 0 < v < 2 n. Since 

||X(w,v)|| 2 = 3 2 (cos 2 v + sin 2 v)sin 2 w + 3 2 cos 2 w = 3 2 , 

the points X(m, v) lie on a sphere of radius 3 centered at the origin. See Fig¬ 
ure 7.21. X is one to one on the interior of the domain, where 0 < u < n, 
0 <v <2n. The partial derivatives 

X m (m, v) = (3 cos v cos u, 3 sin v cos u,- 3 sin u) 

X v (u,v) = (-3sinvsinw,3cosvsinw,0) 

are linearly independent where sin u ± 0. We compute a normal vector to the 
tangent plane, 

X u (u,v)xX v (u,v) = 9sinw(cosvsinw,sinvsim/,cosw), 
and its norm 

\\X u (u,v) xX v (m, v)|| = 9 sin u. 


□ 

One way to parametrize the graph of a C 1 function f(x,y ) = z is to define a new 
function X(x,y) = (v,y,/(v,y)) from the x,y plane to R 3 . 

Example 7.29. Let f(x,y ) = ^R 2 -x 2 -y 2 , R > 0. Define 
X(x,y) = [x,y, ^ 2 -x 2 -y 2 ) 
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for x 2 +y 2 < a 2 < R 2 . See Figure 7.22. The partial derivatives 




1 , 0 ,- 


x/R 2 -x 2 -y 2 i 


Xy= 0,1, 


\JR 2 - a ' 2 -y 2 ; 


are linearly independent because the pattern of zeros and ones prevents either 
vector being a multiple of the other. Since 0 < x 2 +y 2 < a 2 < R 2 the derivatives 
are bounded: 



R R 

< < and 

y 

R 

< 

^R 2 -x 2 -y 2 

xjR 2 - x 2 - y 2 V R 2 - a 2 

y/R 2 -x 2 -y 2 

Vr 2 ^ 2 


The range S of X is the part of the upper hemisphere of radius R centered 
at the origin, that sits above the disk of radius a > 0 centered at the origin in 
the x,y plane. See Figure 7.22. A normal vector to the plane tangent to S at 
X(v,y) is 


X x (v,y)xX3;(x,y) = 


^/r 2 -x 2 -y 2 ' V^ 2 -" 2 -" 2 

R 


,1 


and its norm is ||X x (x,y) xX :v (%,);)|| = 




V/? 2 -x 2 -y 2 



Fig. 7.22 The surface S in Example 7.29. 

We call X(x,y) = ( x,y,f(x,y )) the (x,y) parametrization of the graph of /. Here 
X^(x,y) = (l,0,/ f (x,y)), X v (x,y) = (0, l,f y (x,y )) 
and a normal vector at the point ( x,y,f(x,y )) is 

X x (x,y)xX y (x,y) = (-f x (x,y),-f y (x,y), 1). 
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Area Now that we have a definition of a smooth surface, we define its area. 

Definition 7.8. Suppose S' is a smooth surface parametrized by X from a 
smoothly bounded set D in the u , v plane into R 3 . The area of S is defined 
as 

Area(S)= dcr = \\X u (u,v) xX v (u,v)\\ dudv. 

Js Jd 


Take the case where S is the graph of a function / on D , and S is parametrized 
by X(x,y) = (x,y,f(x,y)). Then ||X A .(x,y) x X v (x,y)|| = yj 1 + fy + f; and the surface 
area of S is 

Area(S) = + ft + fydxdy. 

We show that Definition 7.8 is a reasonable definition for the area of a smooth 
surface by looking at some estimates for the area. Take a point X(u , v) on S and 
three nearby points X(u+Au,v), X(u,v + Av), X(u + Au,v + Av). See Figure 7.23. 

By holding v fixed and letting u vary, X{u , v) parametrizes a curve in S. Since X 
is C 1 , the secant vector X(u + Au,v) - X(u,v) is well approximated by the tangent 
vector X u (u,v)Au. Similarly if we hold u fixed and vary v we get another curve in S 
with a secant vector X(u, v +Av) - X(u,v) well approximated by the tangent vector 
X v (u,v)Av. By Definition 7.7, X u (u,v) and X v (u,v) are linearly independent. The 
area of the parallelogram determined by the two tangent vectors is 

\\X u (u,v)AuxX v (u,v)Av\\. 

It is a good estimate for the area of the two triangles determined by the secant vectors 
in Figure 7.23. 



The sum of the areas of the secant triangles tends to the integral formula for the 
area of S , 
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JD 



Fig. 7.24 Approximating area by parallelograms. 

Example 7.30. Find the area of the sphere S of radius R > 0, using the 
parametrization 

X(u,v) = (Rcosvsinu,Rsinvsinu,Rcosu), (0 <u<n, 0 < v < 2n). 

We get 


X u (u,v) xX v (w,v) = R 2 sinw(cosvsinw,sinvsinzi,cosz/) 


and ||X m (m,v)xX v (m,v)|| = R 2 sinu. So 



= R 2 dv sinwdw = R 2 (2ti)2 = 4 nR 2 . 


'o Jo 


□ 


Remark. The parametrization of the surface of a sphere in Example 7.30 is 
inspired by spherical coordinates, where u and v play roles similar to latitude and 
longitude on a map. This suggests denoting the parameters by (6,(p) rather than 
0,v). 

Example 7.31. Find the area of the part of the upper hemisphere given by the 
(x,y) parametrization in Example 7.29, 




where 0 < a 2 < R 2 . In Example 7.29 we showed that 
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l|X x (x,y) x X y (x,y)|| = — - , 

V^ 2 -x 2 -j 2 

SO 

Area (5) = Id cr = — — dxdy. 

Js J D ^ R 2 _ x 2_ y 2 

The integral is easier to evaluate if we change variables to polar coordinates. 
The region D in polar coordinates is 0 < 6 < 2n, 0 < r < a. So 


Area (S') 


-L 


R 


■\jR 2 -x 2 -y 2 


dxdy 


=*rr 

Jo Jo 


rdrd# 

Vr 2 ^?- 


= 2nR^-(R 2 -r 2 ) 112 


r-a 

r =0 


2nR(R-(R 2 - a 2 ) 112 ). 


In Problem 7.34 we ask you to deduce from this formula a classical discovery 
by Archimedes. Note that as a tends to R , the area tends to 2nR 2 , the area of 
the upper hemisphere of radius R. □ 

Now that we have shown that the area formula works for spheres let’s look at 
some other simple examples where we may not know the area. 



Fig. 7.25 The surface in Example 7.32 is the part of the cone between z= 1 and z = 3. The u,v 
rectangle is for the parametrization in Example 7.33. 


Example 7.32. Let S be the part of the surface z 2 = x 2 +y 2 that lies between 
the planes z = 1 and z = 3. See Figure 7.25. Find the area of S. We need to find 
a parametrization of S , and the simplest approach is the (x,y) parametrization 
since S can be thought of as the graph of the function 

Z = f(x,y)= JE+y 2 , 

defined on the annulus D in the x, y plane where 1 < x 2 +y 2 <9. Let 
X(x,y) = (x,y,f(x,y)) = (x,y, JE+y 2 ) 
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on D. Then 

X x (x,y) = (1,0 ,Ux,y)) = (1,0 ,x(x 2 +y 2 Y m ) 
X y (x,y) = (0,1 ,f y (x,y)) = (0, l,j(x 2 + y 2 y 112 ) 
X x (x,y) X X y (x,y) = (- f x (x,y),-f y (x,y), 1) 

\\X x (x,y) x X,(jc,y)|| = J7 2 + / 2 + 1 

and we find 



] x 2 +y 2 x 2 +y 2 


1 = V2. 


Therefore 

Area (S') 


f dcr= [ \\X x (x,y)xX y (x,y)\\dA 
Js Jd 

- I" V2dxdy = V2Area(Z)) = V2(7 t 3 2 -7rl 2 ) = 87rV2. 

Jd 


□ 

Example 7.33. Another way to compute the area of S , the part of the surface 
z 2 = x 2 + y 2 in Example 7.32, is to use the parametrization 

X(u,v) = (wcosv, wsinv,w), 1 < w < 3, 0 < v < 27 t 


suggested by polar coordinates. D is the rectangle [1,3] x [0, 2n\. When u-k 
the points 

X(k,v) = (&cosv,/:sinv,/:) 

are on a circle of radius k in the z = k plane in R 3 centered at (0,0, k). As k 
increases, larger circles are mapped into higher planes. We calculate 


X m (m,v) = (cos v, sin v, 1) 

X v (u,v) = (-i/sinv,i/cosv,0) 

X u (u,v) xX v (u,v) = (-ucosv,-usinv,ucos 2 v + usin 2 v) 
\\X u (u,v)xX v (u,v)\\ = V2 u 


and 


Area (S)= f dcr = f V2 u 

Js Jd 

=a 


dwdv 


* 271 - r 3 32 _ ^ 2 

V2wdwdv = 27T V2—-— = 87T V2. 
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We have seen some examples where two different parametrizations of a surface 
are used to calculate the area for the surface. Here is another class of such examples. 
Take first the case where the surface S is part of a level set 

g(x,y,z) = c 

with Vg + 0. According to the Implicit Function Theorem, Theorem 3.13, one or 
more of the variables is a function of the other two, locally near any point of S. 
Suppose we have 

z = f(x,y), (x,y) in D. (7.2) 

Then f x = - —, f y = and the area of S is 
8z 8z 


yjfi+fy + idxdy. 

Example 7.34. Find the area of the part of the surface g(x,y, z) = x 2 +y 2 - z 2 = 0 
that lies over the set in the x,y plane where 1 < x 2 +y 2 < 9. (See Examples 7.32 
and 7.33.) By the Implicit Function Theorem for z = f(x,y). 


fx = 


—2x 

^2z 


fy ~ 


-2y 
-2 z 


So the area is 




x*+y2 


+ Idxdy 


= I Vl + 1 dxdy = V2 Area(Z)) = 87r V2. 
Jd 


Suppose that the surface S expressed in (7.2) can also be expressed as the graph 


of 


Then h x 


Sx 

Sy' 


,h z 


_8z 

8y m 


y = h(x,z ), (x,z) in E. 


We show now that 


J d -JEErf + idxdy = J e 4‘ 


h 2 + h 2 + l dxdz. 


There is a mapping F from E onto D , 

F (x,z) = (x,h(x,z)) = (x,y). 


See Figure 7.26. The Jacobian of F is 
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J¥(x,z) = det 1 ° 

“X h Z \ gy 


The change of variables formula, Theorem 6.12, gives 


dxdy = J" 


gl + gy+gz l ~g z 
Sz ' Sy 


— \dxdz = 

Sy 



hl + l+hz dxdz. 



z 


y 


Fig. 7.26 S is both the graph of z = f(x,y ) and the graph of y = h(x,z). 

More generally suppose we have two parametrizations, one Xi (m, v) from D\ onto 
S , and another X 2 (s, t ) from Z) 2 onto S , and a differentiable “change of parametriza- 
tion” G so that 

(s,t) = G(m, v) 

X 2 (j,0 = X 2 (G(ii,v)) = X 1 (ii,v) 

for all ( 5 ,0 in D 2 and (m, v) in £> 1 . See Figure 7.27. By the Chain Rule 


DX 2 (G(u,v))DG(u,v) = DX\(u,v) 


(7.3) 


In Problem 7.45 we guide you to a proof that (7.3) implies that 


X 2 ^(w, v) x X 2 f(w, v) detZ)G(w, v) = Xi M (w, v) x Xi v (m, v). 


(7.4) 


By (7.4) and the change of variables formula, 


AreaOS) = \\X 2 S (s,t)xX 2 t(s,t)\\dsdt 




||X 2j (G (m, v)) x X 2 ,(G(m, v))|| | detDG(w, v)| d u dv 
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The area of S is independent of the parametrization of S . 



y 


X 


u 


Fig. 7.27 S is parametrized by both Xi and X 2 . 


Surface integrals. Suppose at each point (x,y,z) on a surface S we know the charge 
density f(x,y,z) [charge/area]. What is the total charge distributed over S ? Given a 
parametrization of S , X(m, v) defined on D , we can approximate the charge on the 
patch of S that is the image of a Au by Av rectangle in D by multiplying the density 
at a point on the patch, f(X(u, v)), by the approximate area of the patch: 

/(X(m, v))\\X u (u, v)Au x X v (w, v)zfv|| = f(X(u, v))|| X u (u, v) x X v (w, v)|| \AuAv\. 

The total charge is the sum of the charge on the patches. This motivates the definition 
of the surface integral of / over S. 

Definition 7.9. Suppose S is a smooth surface parametrized by X from a 
smoothly bounded set D in the u , v plane to R 3 , and suppose / is a contin¬ 
uous function on S . The surface integral of f over S is defined as 



In particular the surface area of S is the integral of / = 1. 

Example 7.35. Let S be the surface x 2 +y 2 = z 2 , 1 < z < 3 of Examples 7.32, 
7.33, and 7.34. Suppose the charge density at each point (x,y,z) of S is 



Find the total charge on S . We parametrize S by 


X(w,v) = (wcosv, wsinv, u), (l <u<3, 0<v< 2n). 


The total charge is 
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f fdcr= f f(X(u,v))\\X u xX v \\dA = C fV“ 2 V2 M 
Js Jd Jo J i 

= 27tV2^ e~ u2 udu = In V2^-^e _w2 j = n V2(e _1 -e -9 ). 

The average charge density on S , using the area we found previously, is 


dudv 


/average : 


/ s /d<r_^V 2 (e-'-e-») = ,_ e _ 9) 


Is d<T 


8^ V2 


□ 

Properties of surface integrals. From properties of double integrals, surface inte¬ 
grals have these properties: If / and g are continuous functions and c is a constant, 
then 








( cfdcr - c I fdcr 

Js Js 

f (f + g)dcr= f fdcr+ f gdcr 
s a s a s 

If m < f < M then m Area (S) < J" 


fdcr < MArea(5). 


We also define the integral over piecewise smooth surfaces. If S is a union of 
finitely many smooth surfaces S that intersect only pairwise on common 
boundaries, such as the six sides of a cube that meet along their edges, we have 


f fdcr = f fdcr + • • • + f fdcr. 

JSiUS 2 'J“-'JS m JS 1 JS m 


Flux of a vector field across a surface. In Chapter 1 we defined the volumetric 
flow rate or flux of a constant velocity field U across the parallelogram determined 
by V and W in the direction of V x W as 


U-(VxW). 

V x W 

See Figure 7.28. Let N be the unit normal N = ———— and call the parallelogram 

||VxW|| 

S ; then the flux of U across S in the direction VxWis 


U • (V x W) = U • 


V x W 
IIVxWH 


||Vx W|| = U-NArea(S). 


Suppose now F is a vector field whose domain contains a smooth surface S , 
parametrized by X from the u , v plane to R 3 . By Definition 7.7 the vectors X u (u,v) 
and X v (m, v) are linearly independent and determine a plane tangent to S at X(u,v). 
The vector 

1 

\\X u (u,v) xX v (u,v)\\ 


N(X(m,v)) = 


■X„(m,v)xX v (m,v) 







7.3 Surfaces and surface integrals 


321 



Fig. 7.28 Flux of U across the parallelogram determined by V and W. 


is a unit normal to the tangent plane at X(u, v). See Figure 7.29. We call 

^ NT w X„(m,v)xX v (m,v) 

F(X(m, v)) • N = F(X(m, v)) • 77 —- , - 777 

I|X«(m,v)xX v (m,v)|| 

the normal component of F at X(w, v). Next we define the flux of F across S . 




Fig. 7.29 Left: A unit normal vector N at X(w, v) on S . Right: F(X(w, v)) • N(X(u, v)) is the normal 
component of F at X(w, v). 


Definition 7.10. Let S' be a smooth surface parametrized by X from a 
smoothly bounded set D in the u, v plane to R 3 . 

Let F(x,y,z) = (/i(x,y.z),/ 2 (x,y,z),/ 3 (x,y,z)) be a continuous function on 
S . The integral of the normal component of F, F • N, over S is called the flux 
of F across S in the direction of N. Using Definition 7.9 it is 

f F • N d<r = f F(X(M,v))-N(X(M,v))||X M (M,v)xX v (M,v)||dMdv 
Js Jd 


■X 


F(X(k, v)) • (X„(M, v) x X v (u, v)) dwdv. 
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Fig. 7.30 Positive flux of F across S at the left, and negative flux of G across S at center. For the 
flux of H at the right we can’t tell the sign of the flux at a glance. 


Example 7.36. Let S be the surface x 2 +y 2 = z 2 , 1 < z < 3. Find the flux of 
F(x,y,z) = (-y,x,z) across S, J" F -Ndcr, where N points away from the z 
axis. We know from Example 7.33 that 

X(u,v) = (w cos v,w sinv, u), 

is a parametrization of S and that 


1 <u<3, 0 < v < 27T 


X u (u,v) = (cos v, sin v, 1) 
X v (u,v) = (-wsinv, wcosv,0) 
X u (u, v) x X v (m, v) = (—u cos v, —u sin v, u) 
\\X u (u,v)xX v (u,v)\\ = ^2u 


The vector 


. T/ x X M (u,v)xX v (u,v) (-u cos v, -m sin v, u) , . 

N (u,v) = Mv - - — — - -— =-—-= -^= (— cos v, — sin v, 1) 


l|X I i(w,v)xX v (w,v)|| 


V 2 w 


is normal to S'. At w = 2, v = N(2, |) = -^=(0, -1,1) points toward the z axis. 

See Figure 7.31. To get the net flux of F across S in the direction away from 
the z axis we use the opposite unit normal 


N(w,v) = 


X v (u,v)xX u (u,v) _ 1 
||X v (m,v)xX b (m,v)|| “ V2 

27T /-*3 


(cosv, sinv,-1). 


v)||dwdv 


r* r*Z 7 T AO 

I F • Ndcr = I F(X(m, v)) • N(w, v)||X v (m, v) x X„(m, 

JS Jv=0 Ju=l 

r*27T 3 

= I I (-w sin v, w cosv, w) • -^= (cosv,sinv, -1) V2wdwdv 

J v =0 Jm=1 V2 


r»27r /^3 

= f f (-« 2 

Jv=0 Ju= 1 


) dw dv = 27 t 


1 „3 


= ±2ct(-26). 


□ 










7.3 Surfaces and surface integrals 


323 



-pi. — /j-g . .. . X u (u,v)xX v (u,v) 

Fig. 7.31 A unit normal ——--———-— 

||X„(k,v)xX v (k,v)|| 

the “inside” of the cone, toward the z axis. 


at the point (0,2,2) in Example 7.36 points toward 


Orientation of surfaces. An important thing to notice in the definition of the flux 
integral J F • N d<x is that the sign of the integrand depends on the order in which 

Js 

the cross product of X u (u, v) and X v (u, v) is taken. The resulting unit normals 

X u (u, v) x X v (w, v) X v (m, v) x X u (u, v) 

\\X u (u,v) xX v (u,v)\\ ’ \\X v (u,v)xX u (u,v)\\ 

can be used to distinguish the two sides of a surface. For example, a sphere S has a 
set of unit normals that point outward and a set that point inward. In Example 7.28 
we parametrized a sphere of radius three centered at the origin by 

X(u,v) = (3 cos v sin w, 3 sin v sin w, 3 cos z/), 0 < u < n, 0 < v < 27r. 

Which way do the unit normals 


X u (u,v)xX v (u,v) 

||X m (w,v)xX v (k,v)|| 

point? We found that X u (u,v)xX v (u,v) = 9sinw(cosvsinw,sinvsinw,cosw). Where 
si nu is positive, this is a positive multiple of X(u, v), a point on the sphere, so these 
normals point outward. 

When we choose a side for the normals we say the surface is oriented. If a surface 
S is a union of n oriented surfaces S\,...,S n , then S is orientable if we can choose 
unit normals so that if edges between the surfaces were smoothed, we would be able 
to extend the normals continuously across the smoothed edges. Let’s look at another 
example. 

Example 7.37. Let squares S \ and S '2 be parametrized by 
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Fig. 7.32 The surfaces in Example 7.37. The left and center surfaces are oriented. The surface on 
the right is not oriented. 


Xi(x,t) = (v,y,4) 0<v<l, 2<y<3, 

X 2 (x,z) = (x,3,z) 0<x<1, 4<z<5. 

See Figure 7.32. The unit normals to Si are (0,0,1) or (0,0,-1), and unit 
normals to S 2 are (0,1,0) or (0,-l,0). For S = S\ US 2 to be an oriented 
surface we need to choose unit normals that are consistent, so 

(0,0,-1) on Si, (0,1,0) onS 2 

as at the center in the figure, or 


(0,0,1) on Si, (0,—1,0) onS 2 

as at the left in the figure. □ 

At the right in Figure 7.32 is a surface that has two sides and is orientable, but 
this way of assigning unit normals does not result in an oriented surface. 

Not all surfaces are orientable or have two sides. Take a strip of paper, give it a 
half twist and tape the two short ends together to make a Mobius band. The strip 
before you joined the ends was an orientable surface. It had two sides. But after you 
put the twist in and joined the ends any normal you try to move around the surface 
comes back the opposite way after one loop. 

The notion of the flux of a vector field across a surface only makes sense for 
orientable surfaces; therefore we only work with orientable surfaces. 



Fig. 7.33 The surface S of the box in Example 7.38. 
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Example 7.38. Find the flux of F(v,y,z) = ( x 2 ,y + 1 ,z~y) outward across S, 
the surface of the box [0,2] x [0,3] x [0,4] shown in Figure 7.33. 


f F • N d<r = f 

Js Js 1 


F-Ndcr 


US2US3US4US5US6 


= f F• Ndcr + f F• Ndcr + f F-Ndcr+ f F-Ndcr+ f F-Ndcr + f F-Ndcr. 

Jsi 3s 2 3s^ Js 4 Js$ 3s 6 

Parametrize S\ by Xi(y,z) = (2,y,z). The outward unit normal vectors are 
(1,0,0) and 

f F • N dcr = f f 

Js i Jz =0 Jy= 

Similarly 

f F • N dcr = f f 

Js2 3z-0 3y-0 

f F• Ndcr = f 4 f (x 2 ,3 + l,z-3)-(0,l,0)dxdz = 4(8) = 32, 

Js 3 Jz=0 J.v=0 

f F• Ndcr = f f (x 2 ,1,2-0)• (0,-l,0)dxdz = -1(8) = -8, 

Js 4 3z-o3x~o 

^ F• Ndcr = I" r (x^y+hd-yFdfO,l)dxdy = 2^-^(4-y) 2 J 0 - 15, 

Js 5 Jy-OJx-O 

f F• Ndcr = f f (x 2 ,y+l,0-y)-(0,0,-l)dxdy = 2fiy 2 l 3 =9. 

Js 5 Jy =0 3x -0 L Jo 


F • Ndcr = I I (2 z ,y + l, z -y) • (l,0,0)dydz = 4(12) = 48. 

z=0 Jy=0 


(0,y+ 1 ,z-y) • (-l,0,0)dvdz = 0, 


Js 5 

Therefore 


I f 


Ndcr = 48+ 0 + 32-8+ 15+ 9 = 96. 


Alternative ways to find the flux of F across S'. If S' is a graph of a C 1 function 
z = f(x,y) over D , we can also think of S as the level set of the C 1 function g from 

R 3 to R, 

g(x,y,z) = z-f(x,y) = 0. 

Since the gradient of g is normal at each point of S , 

Vg(x,y,z) = (-fx(x,y),-f y (x,y), 1), 
the two possibilities for a unit normal to S' at a point (x,y,/(x,y)) are 

(- fx(x,y),-f y (x,y),i ) 


N = +- 


^Jfx+fy 


+ 1 


Note that 
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N • (0,0,1) = + 

is plus or minus the cosine of the angle between the tangent plane to S at ( x,y , f(x,y )) 
and the x,y plane. See Figure 7.34. That gives us a way to estimate the area of the 
part of S that sits above a small part of the domain D in the x,y plane. It is the area in 
the x,y plane times the reciprocal of the cosine of the angle between the two planes, 

dcr ~ -\jfx + fy + 1 dA. So the upward flux of F across S is 

f F • N d<x = f F(x,yJ(x,y))-(-Mx,y),-f y (x,y)J)dxdy 
Js Jd 

where D is the domain of / and S is the graph of / over D. 



Fig. 7.34 The angle between the tangent plane and the x,y plane. N is drawn upward. 


Example 7.39. Let the surface S be the part of the graph of f(x,y) = x 2 +y 2 
above the rectangle R = [0,1] x [0,3], with unit normal vectors N having a 
negative z component. The flux of F(v,y,z) = (-x, 3 ,z) across S is 


f F • Ndcr = f F(x,y,/(x,y)) • (f x (x,y),f y (x,y),-l)dxdy 
Js Jd 


= f f (~x,3,x 2 +y 2 ) • (2x,2y,-l)dxdy = f f (~3x 2 -y 2 + 6y)dxdy = 15. 
Jy-0 Jx-0 Jy-0 Jx-0 

□ 

Evaluating J* F • N dcr geometrically Sometimes we can evaluate a flux integral 

without parametrizing the surface, by making use of special geometric relationships 
between the vector field F and the surface S. 

X 

Example 7.40. Let F(X) = — j]x[p ^ nverse s 4 uare vector held. Find the 

flux of F outward across a sphere Sr of radius R centered at the origin. The 
direction of F(X) is in the opposite direction of X, and ||F(X)|| = ||X|| -2 = R~ 2 . 
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The outward unit normal vector N(X) = 
of F(X). Therefore 


IIXII 


points in the opposite direction 


F(X) -N(X) = ||F(X)|| ||N(X)||cos# = ||F(X)||(1)(— 1) = -R~ 2 . 


The flux of F outward across Sr is 

f F • N dcr = f -3- dcr = -3- Area (5 r) = --^r4nR 2 = -4n. 

Js R Js R R 2 R 2 R 2 

□ 

If in Example 7.40 we had computed the flux of the vector field into the sphere 
we would have used the inward pointing unit normals and the cosine of the angle 
between F and N would have been 1 instead of -1. The inward flux of F across S is 
4n. 


Example 7.41. Suppose at each point of a surface S , F(X) is tangent to S. 
Find the flux of F across S. Since F(X) is tangent to S , F(X) • N(X) = 0 at 
each point. So 


Is 


F • N dcr = 0. 


Example 7.42. Find the flux of F(v,y,z) = (x,y + 3x 2 ,z) outward across a 
sphere S of radius R centered at the origin. The outward pointing unit nor¬ 
mal vectors are 

N = / 7 ? 7 = 
yjx 2 +y 2 +z 2 K 

at every point (x,y,z) of the sphere. The flux is then 

f F-Ndcr= T (x,y + 3x 2 ,z)-(-^(x,y,z))do- 

Js Js R 




(x 2 + y 2 + 3x 2 y + z 2 ) dcr 


On S, x 2 + y z + z 2 = R 2 , so the first integral is 


-six 


(x 2 +y 2 +z 2 )dcr + 3 I (x 2 y)dcr . 






! X 


+ y +z )dcr = R dcr = R z Area (S') = 4nR . 


The second integral I (v 2 y)dcr is zero due to symmetry. (See Problem 7.37.) 

Js 

Therefore the flux is R~ 1 {4tiR a ) = 4 tiRJ . □ 
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Problems 

7.31. Set up integrals for the areas of the graphs of z = x 2 +y 2 and z = x 2 -y 2 over 
a smoothly bounded set D in R 2 . Show that for all D , the two graphs have the same 
area. 

7.32. Let S \ be the part of the plane 

j x + + jZ — 10, 

that lies over the rectangle 0 < v < 1 and 0 < y < 1 in the x,y plane. Let S '2 be the 
part of the same plane that lies over the rectangle 0 < y < 1 and 0 < z < 1 in the y, z 
plane. Show that Area(S 1 ) = \ and Area(S 2 ) = 

7.33. Suppose a 2 + b 2 + c 2 = 1, and let S be the part of the plane ax + by + cz = d that 
lies over a rectangle D in the x,y plane. Show that 

\c\ Area(S) = Area(D). 

7.34. Let R be the radius of a sphere. It is part of a classical discovery by Archimedes 
that the area of a spherical cap of height h, 2nRh, is the same as the area of a cylinder 
of the same height h and radius R. 

(a) Verify that the expression (R- (R 2 -a 2 ) 112 ), found in Example 7.31, is the height 
of the spherical cap there, thus proving Archimedes’ result. 

(b) Without evaluating any further surface integrals, deduce another discovery by 
Archimedes: that the area of every section of height h of a sphere has the same 
area. See Figure 7.35. 



Fig. 7.35 All sections of equal height through a sphere have the same surface area, in Prob¬ 
lem 7.34. 


7.35. Let X(m, v) = ( V2 mv, u 2 , v 2 ), (1 < u < 2, 1 < v < 2). 

(a) Calculate X M , X v and X u x X v . 

(b) Show that X parametrizes a surface S in R 3 , and at each point (x,y,z) of S , 
x 2 = 2 yz. 

(c) Find the area of S . 
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£ 


(d) Calculate the integral | ydcr. 

7.36. Let S be the surface of the cylinder in R 3 where x 2 +y 2 = r 2 and 0 <z<h, 
where r and h are positive constants. 


(a) Verify that a parametrization of S is 

X(w,v) = (rcosw,rsinw,v), 


0 <u< 2n, 0 < v < h. 


(b) Use X(w,v) to show that the area of S is 2nrh. 

(c) Evaluate I ydcr. 


£ 


(d) Evaluate J y 2 dcr. 

7.37. Let S' be the unit sphere in R 3 centered at the origin. S has a symmetry , that 
for each point (x,y,z) on S there is another point (-x, -y, -z) also on S. Show that 


I 


x 2 yd<x = 0. 


7.38. Let S be the unit sphere centered at the origin in R 3 . Evaluate the following 
items, using as little calculation as possible. 


(a) 

(b) 

(c) 


l 
l 

Verify that I x 2 dcr = I x\ dcr = I x 2 dcr using either a symmetry argument 

S S s 


l|X|| 2 dcr 


or 


Js Js 

parametrization without evaluating the integrals. 

(d) Use the result of parts (b) and (c) to deduce the value of 

7.39. Let S be the unit sphere centered at the origin in R 3 . 
(a) Show that x^ + x^ + x^ + 2(x 2 x 2 + x 2 x 2 + x 2 x 2 ) = 1 on S . 


J r x 2 dcr. 


(b) Evaluate J x 3 dcr using spherical coordinates. 

(c) Evaluate J" x 2 x 2 dcr using items (a), (b), and a symmetry argument. 

7.40. Let S be the flat parallelogram surface determined by vectors V and W, and 

y x W 

let N = 7 ^—. Verify that the flux of a constant vector field F across S is 


IIVxWH 


x 


F-Ndcr = F-NArea(S). 
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Fig. 7.36 Surfaces in Problems 7.41 and 7.42. 


7.41. Find the flux of the constant vector field F = (2,3,4) through each of the four 
oriented surfaces in Figure 7.36. 

7.42. Find the flux of the vector field F = (2y, 3z, 4x) through surface (a) in Fig¬ 
ure 7.36. 

7.43. Find the flux of the vector field F = (2y, 3 z, 4v) across the parallelogram deter¬ 
mined by the vectors V = (1,1,1) and W = (0,0,2) in the direction of V x W. 



Fig. 7.37 The oriented surface in Problem 7.44. 


7.44. Let S be the oriented surface shown in Figure 7.37. Determine without calcu¬ 
lation which of the following fluxes are positive. 

(a) J (0,1,0)-Ndcr 

(b) J (0,3y,0)-Nd<x 

(c) J (l,3y,0)-Nd<x 
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Js 

7.45. Suppose A,B,C,D are vectors of R 3 written as columns, and that 



3x2 


= [C D]. 


3x2 


Justify the following statements to prove that 


(ad - be) A xB = CxD. 


(7.5) 


Remark. We used this formula in the “change of parametrization.” 
(a) If we adjoin any third column Y to create square matrices, then 


a b 0 

[A BY] cd 0 = [C D Y]. 


[001 


3x3 


(b) Then Y • (A x B )(ad - be) = Y • (C x D). 

(c) Conclude (7.5). 

7.46. As we will see in the next chapter, the flux of fluid mass across a surface S is 



where p is the fluid density and V the velocity. Evaluate this integral for two cases: 

(a) S is a square of area A parallel to the x,y plane with unit normal vector 
N = (0,0,1), p is constant, and V = (a,b,c) is also constant. 

(b) S is a square of area A with a unit normal vector N, p is constant, and V is a 
constant k times N. 

7.47. As we will see in the next chapter, the flux of fluid momentum across a surface 
S is vector valued (integrate componentwise): 



where p is the fluid density and V the velocity. Evaluate this integral for two cases: 

(a) S is a square of area A parallel to the x,y plane with unit normal vector 
N = (0,0,1), p is constant, and V = (a,b,c) is also constant. 

(b) S is a square of area A with a unit normal vector N, p is constant, and V is a 
constant k times N. 


7.48. Let S be the part of the plane 
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3x- 2y + z = 10 


over the square D = [0, 1] x [0, 1] in the x,y plane, and take the normal vectors N 
upward, that is with positive z component. 

(a) Express the plane as the graph of a function of x and y, and find N. 

(b) Find the area of S and verify that the area of S is not less than the area of D. 

(c) Let F be a constant vector field (<a , b , c). Find the flux of F across S . 

7.49. Let S be the sphere of radius R > 0 centered at the origin of R 3 . Consider the 



. Show that the outward flux 



thus the flux does not depend on the radius of S . See Figure 7.38. 

7.50. Let S be the sphere of radius R > 0 centered at the origin of R 3 . Consider a 
vector field of the form 


F(X) = *(||X||)X, 



does not depend on the radius of S . Prove that k(||X||) is some multiple of ||X|| 3 . 
See Figure 7.38. 



Fig. 7.38 Flux of inverse square field through sphere, in Problems 7.49 and 7.50. 

7.51. Suppose surface S is parametrized by X(m,v), (m,v) in some smoothly 

bounded set D , and k is a positive number. Let T be the set of points 


Y (m, v) = kX(u, v), (m, v) in D. 


Show that T is a smooth surface parametrized by Y. Show that 

Area(r) = k 2 Area (S). 



Chapter 8 

Divergence and Stokes’ Theorems and 
conservation laws 


Abstract Green’s and Stokes’ Theorems are extensions to functions of several vari¬ 
ables of the relation between differentiation and integration. 


8.1 Green’s Theorem and the Divergence Theorem in R 2 

The Fundamental Theorem of single variable calculus asserts that differentiation 
and integration are inverse operations. More precisely, if / is a function that has 
continuous first derivative /' then 





( 8 . 1 ) 


The analogue of this result for C 1 functions of two variables is called the Divergence 
Theorem. As we will see it is obtained by applying the single variable result in each 
variable and integrating with respect to the other variable. 

Take first the case that D is a smoothly bounded set in the plane that is both x 
simple and y simple. Denote by dD the boundary of D. Let c and d be the smallest 
and largest numbers y for which there is a point (x,y) contained in D. Since D is 
x simple, for every value yo between c and d , the set of points (x,yo) in D is a 
single horizontal interval [a(yo),b(yo)] and both a(y ) and b(y) are continuous on 
[c,d]. Denote by a and b the smallest and largest numbers x for which there is a 
point (x,y) in D. Since D is y simple, for every value xo between a and b the set of 
points (xo,y) in D is a single vertical interval [c(xo), d(xo)] and both c(x) and d(x) 
are continuous on [a,b]. See Figure 8.1. 

Figure 8.1 shows a set that is both x simple and y simple and Figure 8.2 shows 
two that are not. Let / be a function that has continuous first partial derivatives f x 
and f y in D. Fix y and apply the Fundamental Theorem of Calculus in one variable. 
We get 
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Fig. 8.1 Left: D is x simple with boundary functions x = a(y ) and x = b(y). Right: D is y simple 
with boundary functions y = c(x) and y = d(x). 



Fig. 8.2 Left: D is x simple but not y simple. Right: C is y simple but not x simple. 


rhiy) 

| fx(x,y)dx = f(b(y),y)-f(a(y),y), 

Ja(y) 

where y lies between c and d. 

Integrating this relation with respect to y from c to d we get 

n x=b(y) pd 

fx(x,y) dxdy= I (f(b(y),y)-f(a(y),yj\dy. 

=a(y) Jc 


The iterated integral on the left is the integral of f x over D , I f x dxdy. Next we 

Jd 

observe that the integral on the right side is equal to the line integral 



fdy 


taken counterclockwise along the boundary of D, using parametrizations X\(y) = 
(b(y),y) on the graph of b(y) with y running from c to d , and X 2 (y) = (a(y) 9 y) on the 
graph of a(y ) with y from d down to c. Therefore the integral formula can be written 
as 

f fxdxdy = f fdy. (8.2) 

JD JdD 

Let g be another continuously differentiable function on D. We derive an analogous 
formula for the integral of g y over D\ 


r>b nd(x) 
Ja Jc{x) 


nb 

gy(x,y) dydx= I (g(x,d(x))-g(x,c(x)j) dx. 
Ja 
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The left side is I g y dxdy and the right side is equal to a line integral of g taken 

JD 

over dD in the clockwise direction. Taking it in the counterclockwise direction we 
get 

f gy dxdy = - f gdx. (8.3) 

JD JdD 

Adding the two formulas (8.2) and (8.3) we get 


f (fx + gy)dxdy = f fdy-gdx. 
JD JdD 


(8.4) 


We restate this result in vector language. Let X(s) = (x(s),y(s)) be the arc length 
parametrization of the boundary of D increasing in the counterclockwise direction. 

s is between 0 and the length of dD. Since the tangent vector (t~, ^-) has length 1 

ds ds 

at each point of dD , N = (^,-" 7 “)isthe outward pointing unit normal. Recall in 
ds ds 

Section 3.5 we defined divF = f x + g y , where F = (/,g). Then we can write (8.4) as 

F-Nds. (8.5) 

JdD 

This is called the Divergence Theorem. 


f divFdvdy = ( 

JD Jdl 



Fig. 8.3 In Example 8.1 the unit normals and F are parallel on the boundary of the disk. 


Example 8.1. We verify the Divergence Theorem in the case F(x,y) = (x,y) 
and D is a disk of radius R centered at the origin. At each point on the bound¬ 
ary, the outward pointing unit normal to dD is parallel to F. See Figure 8.3. 
Therefore at each point on dD , F • N = ||F|| = R. The line integral is 



F-Nds 



l|F||d s = R f 
JdD 


ds = R(2nR) = 2nR 2 . 


Since divF = — (x) + — (y) = 2, the integral of divF over D is 
dx dy 
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f divF dA = 2 f 
Jd Jd 


d A = 2 7iR\ 


□ 

Example 8.2. Let F(v,y) = (x,y) and let D be the rectangular region where 
-3 <x< 5 and -1 <y <2. 

Find the flux of F outward across dD. See Figure 8.4. Since divF = 2, by the 
Divergence Theorem 



F-Nds 


X 


divF d A = 


X 


2dA = 2 Area (D) = 2(8)(9) = 144. 


□ 



Fig. 8.4 The rectangle in Example 8.2 with outward pointing unit normals. 


Example 8.3. We can also compute I F • Nds of Example 8.2 by computing 

JdD 

four line integrals. 

f F-Nds = f F-Ndn f F-Ndn f F-Nds+ f F-Nds. 

JdD Jc i Jc 2 Jc 3 Jc 4 


We get 
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X 


JC 3 


JC 2 


Jc 4 


F-Nd 5 = 

I' (x,-7)-(0,-l)dx= J 1 

F-Nd j = 

^ (x,2)-(0, l)dx = 16, 

F-Nds = 

^(5,y)-(l,0)dy=J^5dy 

F-Nds = 

f (- 3,y)-(-l,0)dy = 27. 


Therefore 


f F 


Nds = 56 + 16 +45+ 27 = 144. This agrees with the calcu¬ 


lation in Example 8.2. □ 

Example 8.4. Let F(x,y) = (2x+ y 2 ,y + cosx), then divF = 3. The flux of F 
outward across the boundary of a smoothly bounded set D with area 10 that 
is both x simple and y simple is 

f F • N ds = f divF dA = 3 f dA = 3(10) = 30. 

JdD JD JD 


□ 

We next extend the Divergence Theorem to sets that are finite unions of smoothly 
bounded sets that are both x simple and y simple and have boundary arcs in common. 

Let D be a smoothly bounded set in the x,y plane. Divide D into two parts D\ and 
£>2 by a piecewise smooth curve C in D that connects two points of the boundary of 
D. See Ligure 8.5. 



Fig. 8.5 Sets D\ and D 2 with outward pointing normals. Curves B\, B 2 , and C are shown thin, 
thick, and dotted, respectively. 


We show now: 

If a C 1 vector field F satisfies 


j divF dA = 

JDi 


f F-Nd 5- 

JdD x 


and 


j divF dA = 

Jd 2 


f F-Nd j 

JdD 2 


( 8 . 6 ) 


over the two sets D\ and D 2 then 
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j divF d A = I F 
JD JdD 


Nds 


over the union D = D\ U £> 2 . 

To prove this, add the two equations in (8.6). The sum of the left sides is the 
integral of divF over £). We claim that the sum of the right sides is the integral of 
F • N over the boundary of D. To see this, note in Figure 8.5 that the endpoints of the 
curve C divide the boundary of D into two parts B\ and B 2 . The boundary of D\ is 
the union of B\ and C, and the boundary of £>2 is the union of B 2 and C. 

We note that the normal to the connecting curve C that is outward with respect to 
D\ is the negative of the normal to C that is outward with respect to D 2 . Therefore 
in the sum of the right sides of (8.6) the integrals of F • N over C as the boundary 
of D\ and as boundary of £>2 cancel! The sum of the remaining integrals over the 
boundary of D\ and £>2 is the integral of F • N over the boundary of £>. 

We can extend the Divergence Theorem to smoothly bounded sets that are the 
union of sets that are both v simple and y simple. 

Definition 8.1. We call a smoothly bounded set D regular if it is the union of 
a finite number of smoothly bounded subsets, each of which is x simple and y 
simple and any two have only a boundary arc in common. 


We have proved the Divergence Theorem for a vector field that is C 1 on a 
smoothly bounded set that is v simple and y simple. We have also shown that if 
a set D is a union of two subsets for which the theorem holds, then it holds for £>. 
Using this proposition repeatedly we conclude the following theorem. 

Theorem 8.1. The Divergence Theorem. If ¥ from R 2 to R 2 is C 1 on a reg¬ 
ular set D then 

f divF dA = f F-Nds. 

JD JdD 


All sets of practical or theoretical importance are regular; therefore we confine 
our studies to regular sets. 
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Example 8.5. For the function F(v,y) = (x,0), divF(v,y) = 1. Let D be a reg¬ 
ular set in R 2 . By the Divergence Theorem 

Area(D) = J lcL4= j divFdA = J xdy. 

Jd Jd JdD 

This shows that the area of D can be computed using only measurements taken 
on its boundary. See Figure 8.6. □ 



Fig. 8.7 The set D in Example 8.6. 


Example 8.6. Let F(v,y) = (x 9 -y). Find the outward flux of F across the 
boundary of D shown in Figure 8.7. We compute 

divF = F" W + 7T ( -y) =1-1=0. 

ox dy 

By the Divergence Theorem the outward flux of F across dD is 



F-Nds 


X 


divF dA = 



= 0 . 


□ 

Next we show how to use the Divergence Theorem to find the integral of the 
tangential component of F along dD , 



F-Tds. 


Suppose F is C 1 on a regular set D and dD is traversed in a counterclockwise direc¬ 
tion. If the outward unit normal vector at a point on dD is N = (ji\,ri 2 ), then the unit 
tangent vector there is T = (/ 1 A 2 ) = (-n 2 ,n\). See Figure 8.8. 

So for F = 



F-Tds 



(fti+gt 2 )ds = 


| (gni-fn 2 )ds. 
JdD 


By the Divergence Theorem and the definition of curl from Section 3.5 we have 
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Fig. 8.8 The unit normal and tangent vectors. 


/ ( 


(gn\ + (-f)n 2 )ds 


■{,( 


dg_ 

dx 



curlF d A. 


So we have proved Green’s Theorem. 


Theorem 8.2. Green’s Theorem If F = (f,g) is continuously differentiable 
on a regular set D in R 2 then 

f curlFdA = f F-Tds, 

JD JdD 

where the boundary of D is traversed in the counterclockwise direction. 



Fig. 8.9 In Example 8.7 the unit tangent and F(x,y) point in the same direction at each (x,y) on 
the circle. 


Example 8.7. Find the work done in moving a particle in the counterclockwise 
direction along a circle of radius R centered at the origin in the presence of a 
force field F(x,y) = (-y,x). Green’s Theorem can be applied as follows. The 
work is 



F-Tds 


fM (x) ~i’ ( ~ y) j dA= If + ^ ^ =2Area(D)=2nR2 - 


Alternatively, without using Green’s Theorem note that T and F have the same 
direction at each point on dD. Hence F • T = ||F|| = ^x 2 +y 2 - R and 
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F Td j 



Rds = /^(Length (dD)) = R(2nR) = 2nR 2 . 


Example 8.8. Let F(x,_y) = (-= - x, —t- x 

y x z +y z x z + y z 

centered at the origin. Find 


) and let D be the disk of radius R 


f 1 


F-Tds, 


where we traverse dD in the counterclockwise direction. We cannot use 
Green’s Theorem to evaluate this integral because the domain of F does not 
contain (0,0), hence F is not differentiable (or even defined) on all of D. Let’s 

compute I F • T ds and I ( g x - f y ) d A and compare the results. 

JdD JD 

F and T are in the same direction, and we see F • T = ||F|| = Therefore, 


JdD 

On the other hand 


f F Td ^= f l 

JdD JdD K 


ds = — Length (dD) = — 2nR - 2n. 
R R R 


- —( 
gX ~ /}v( 


l)= 


(x 1 +y 1 )-x(2x) 


dx ' x 2 + v 2 (x 2 + y 1 ) 2 (x 2 + y 2 ) 2 

(x 2 +/)(-!) -(-y)(2y) 


dy x x 2 +y 2 ' 


2 2 
y^-x* 


(x 2 +y 2 ) 2 


(x 2 +y 2 ) 2 ' 


Therefore, I ( g x -f y )dA= | 0 dA = 0. The two integrals are not equal. □ 
Jd Jd 



Fig. 8.10 Left: the loop C in Example 8.9 encircles the origin. Center: the boundary of D is CU Cr. 
Right: D comprises two subregions D\ and D 2 . 


Example 8.9. Find [ F • Tds, where F(v,y) = (—--, - -) and C is the 

Jc y x 2 +y z x z +y 2/ 

loop shown in Figure 8.10 traversed in the counterclockwise direction. F is 

not defined at (0,0) but F is defined on the regular region D between C and 
Cr, a circle of radius R centered at the origin that does not intersect C. The 
boundary of D is C U Cr. We know from Example 8.8 that 

dx ' x 2 + y 2 ' dy ^ x 2 + y 1 ' 
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By Green’s Theorem 



F-Tds 


X 


0 dA = 0. 


Splitting D into two regions D\, as in Figure 8.10 and using that Cr is 
oriented clockwise we get 


1= f F Td f F-Tds + f 1 

JdD JdDi JdD 2 


F-Tds 


In Example 
clockwise it 


= f F-Td s+ r 

Jc Jc 


F-Td j 

clockwise 

8.8 we saw that the integral along Cr counterclockwise is 2n, so 


is —2n. Therefore 


X F 


T ds = 2 n. 


Problems 


8.1. Let R be the rectangular region where \x\ < 4 and |y| <2 ,U the closed unit disk 
centered at 0, and S the disk where x 2 +y 2 <25. We define two more sets in R 2 as 
follows. Let D\ be the closure of the set obtained by removing U from R. Let D 2 be 
the closure of the set obtained by removing R from S . Sketch D\ and D 2 and show 
that they are regular sets. 


8.2. Show that for a regular set D with boundary oriented counterclockwise, 



J ydx = Area(D). 

dD 


8.3. Use the Divergence Theorem to evaluate the outward flux 



,y + 6x 2 )-Nds 


where 

(a) C is the unit circle x 2 +y 2 = 1 , 

(b) C is the boundary of the rectangle [a,b\ x [ c,d ]. 

8.4. Use the Divergence Theorem to evaluate the following integrals where D is a 
regular set and the unit normals N are outward. 

(a) f V(x 2 -/)-Nd^ 

JdD 

(b) I V/ • N ds , where / is a C 2 function that satisfies f xx + f yy = 0 on D. 

JdD 
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8.5. Let D be the right half of the unit disk, 
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x 2 + y 2 < 1, x > 0, 

and let F(x,y) = (1 + x 2 y 2 ,0). Verify the Divergence Theorem by evaluating both 
sides of 

f F-Nds= ^ divFdA 
JdD JD 

where the unit normals N are outward. 

8.6. Let F(x,y) = (-y, x), and let D be the quarter disc with outward unit normals N, 

x 2 + y 2 < 1, 0 < x, 0 < y, 

and denote by R the set drawn in Figure 8.11. 

(a) Compute the integrals 

f divFdA, f F-Nd^ 

JD JdD 

without using the Divergence Theorem. 

(b) Find the flux of F outward across dR. 



Fig. 8.11 The regions D and R in Problem 8.6. 


8.7. Let D be a convex polygonal region in M 2 with vertices Po,Pi,...,P n . Let N, 
be the outward unit normal on the i -th edge, D;, from P ; _i to P; ( D\ is V n to Pi). 
Justify the following items to prove 

n 

L llPf —Pf—lllNj = 0. 

i= 1 

See Figure 8.12 for the case n- 3. 

(a) Let Fi(x,y) = (1,0) and let the unit normals N = (ft 1 ,^ 2 ) be outward. Apply the 
Divergence Theorem to show that 
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(b) Let F 2 (x,y) = (0,1) to show that 


f f 


ri2 cL = 0. 


(c) The integral of a vector valued function is computed componentwise. Conclude 
that 


I Nds = ( | n\ds, J ft 2 ds] = 0. 
JdD ' JdD JdD ' 


(d) Show that 


r N/d^HP/- 

JDi 


P«-il|Ni. 


(e) Show that ^ ||P; -P,_i ||N ( - = 0. 


i= 1 



Fig. 8.12 a, b, and c are the edge lengths, and aN a + + cN c = 0 in Problem 8.7. 


8.8. Consider a rectangular region with edges parallel to the x and y axes. 

(a) Make a sketch of such a region showing the vector field F(x,y) = (0,cosv) at 
various points on the boundary. Explain why the net flux across the boundary is 
zero. 

(b) Make another sketch to show that the net flux of the vector field G(v,y) = (y 2 ,0) 
across the boundary is zero. 

(c) Show that the net flux of the vector field H(x,y) = (y 2 ,cosx) across the boundary 
of the region is zero. 


8.9. Suppose C is a circle in the plane that does not go through (0,0). Show that the 
circulation of 


F (x,y) = 


x 2 +y 2 ’ x 2 + y 2 


along C in the counterclockwise direction is either 0 or 2 n. 


8.10. Suppose F is a C 1 vector field on an annulus D in R 2 whose boundary consists 
of an inner circle C 2 of radius 1, oriented clockwise, and an outer circle C\ of radius 
3, oriented counterclockwise. If 


f F- Tds = 11 and f F-Tds 

Jc 1 Jc 2 


- 9 , 
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find the average of the function curlF over D. 

8 . 11 . Let g from R 2 to R and F from R 2 to R 2 be C 1 functions on a regular set D. 

(a) Show that div(gF) = gdivF + F • Vg. 

(b) Suppose g = 0 on dD. Use the Divergence Theorem to show that 


f (divF)gdA = - f F-VgdA. 
Jd Jd 


(c) Suppose g is C 2 , and 
show that 


take F = Vg. If g = 0 at all points of the boundary of D , 


X 


(zlg)gdA 



dA. 


8 . 12 . Let gi(v,y) = sinvsiny in the region D where 0 < x<n and 0 < y < n. 

(a) Verify that g\ is zero at all points of the boundary of D , and that Ag\ = -2gi. 

(b) Suppose that g is some C 2 function that is zero at all points of the boundary of 
D , and that Ag = 2 g. Use the results of Problem 8.11 to show this is not possible 
unless g is identically zero. 

8 . 13 . Suppose F(v,y) = (f(x,y),g(x,y)) is C 1 and let X(t) = (x(t),y(t)) be a smooth 
curve that satisfies the differential equation 


X' = F(X), 


that is, 

x'(t) = f(x(t),y(t)), y'(t ) = g(x(t), y(tj). 

We say that X is a periodic orbit of period p if p is the smallest positive number for 
which 

X(7 + p) = X(x). 

A periodic orbit is the boundary of a regular set. Use the Divergence Theorem to 
show that if divF > 0 then a curve X that satisfies X' = F(X) cannot have a periodic 
orbit. 

8 . 14 . Suppose P is a regular region in the plane whose boundary is a polygon. We 
list the n vertices 

(xuyi), (* 2 ,y 2 ), •••> (x n ,y n ) 

in order counterclockwise around the boundary. Justify the following steps to show 
that 

2 Area (P) = (-y\x 2 + x\y 2 ) + (-y 2 X 3 + x 2 y 3 ) + • • • + (-y n x i + x n yi). 

(a) I -y dx + x dy = 2 Area (P) 

JdP 

(b) If C denotes a straight segment from a point ( a , b) to point ( p , q) then 

J" -ydx + xdy = -bp + aq. 
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8.2 The Divergence Theorem in R 3 

The Divergence Theorem can be proved for vector valued functions with three com¬ 
ponents of three variables on a smoothly bounded set D in R 3 that is v and y and z 
simple. The idea of the proof is the same as for two variables. We apply the Funda¬ 
mental Theorem of Calculus in one of the variables and then integrate with respect 
to the other two variables. Let 


F (x,y,z) = (f(x,y,z),g(x,y,z),Kx,y,z)). 


Since D is v simple let D yz be the set of all points (y,z) for which some point 
(x,y,z) is in D. See Figure 8.13. We assume D yz is a smoothly bounded set in the y,z 
plane, and that 


a(y,z) < x<b(y,z) 

describes the interval of points (x,y,z) in D for which (y,z) is in D yz . 


z 



b 


y 


Fig. 8.13 A set D in R 3 , and D yz in the y,z plane. 


By the Fundamental Theorem of Calculus we have that 



Integrating this over D yz we get 



(8.7) 


f C f(b(y,z),y,z)-f(a(y,z),y,z )) dydz. 

JDyz 


At each point on the surface v = b(y , z) the cosine of the angle between the tangent 
plane to the surface and the y,z plane is N • i = n\, the v component of the outward 
pointing unit normal N. See Figure 8.14. 
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Fig. 8.15 The ratio of dy dz to dcr is n \. 


Therefore the relation between integrating with respect to dydz in the y,z plane 
and the corresponding surface area dcr above it on x = b(y , z) is 

n\ da = dydz 

so f(b(y , z), y, z)n\ dcr = f(b(y , z),y, z) dy dz. Similarly the cosine of the angle between 
the y 9 z plane and the plane tangent to the x = a(y,z) surface is -n\, where n\ is 
the x component of the outward pointing unit normal. As a result on the graph of 
x = a(y, z) we have 

n\ dcr = - dydz 

and f(a(y,z),y,z)n\ dcr = -/(a(y,z),y,z)dydz. See Figure 8.16. 


z 



Fig. 8.16 Tangent plane and unit normal N where x = a(y,z) and n\ is negative; n\ dcr = - dydz. 
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So summing over the graphs of v = b(y,z) and v = a(y,z ), 


(f(b(y,z),y,z)-f(a(y,z),y,z))dydz= f fmdcr. 



Combining this with (8.7) we have 



Now repeating the argument in the other two coordinates we have 




giving 



( 8 . 8 ) 


where N is the outward unit normal to the boundary of D. Just as in R 2 we can extend 
the result (8.8) to sets D that are regular , i.e., a finite union of smoothly bounded 
sets in R 3 that are x, y, and z simple and have only boundary points in common. This 
result is known as the Divergence Theorem. 

Theorem 8.3. The Divergence Theorem. Let F be a C 1 vector field on reg¬ 
ular set D in R 3 and let N be the unit normals to dD that point out of D. 

Then 



Example 8.10. Let F(v,y,z) = (x 9 y,z) and let D be the solid rectangular box 


2 < v < 4, 7 < y < 10, 1 < z < 5. 


Find the flux of F outward across the boundary of D. By the Divergence The¬ 
orem 




——h —— = 1 + 1 + 1 = 3, we have 
dy dz 



□ 
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Fig. 8.17 The cone in Example 8.11. 


Example 8.11. Let D be a solid cone with base B of area A and height h as 
shown in Figure 8.17. We use the vector field F(x,y,z) = (x,y,z) to show that 
the volume of the cone is \hA. Since divF = 3, the Divergence Theorem gives 

f F • Ndcr = f divFdL = 3Vol(D). 

JdD JD 

The boundary of D consists of the base B and sides S. The unit normal to B is 
k = (0,0,1), and the normals to S are perpendicular to F. Therefore 

f F• Nd<x = f F-kdcr+ f F-Ndcr= f hdcr + 0 = hA , 

JdD Jb Js Jb 

and Vol(D) = \hA. □ 


Example 8.12. Let F(v,y,z) = 


(x,y,z) 


zero: since 


d 


dx (x 2 + y 2 +z 2 ) 3 / 2 

g y 

dy (x 2 +y 2 +z 2 ) 3/2 
d z 


(x 2 +y 2 +z 2 ) 3 / 2 

(x 2 +y 2 + z 2 )- 3x 2 
(x 2 +y 2 +z 2 ) 5 / 2 

(x 2 +y 2 + z 2 )-3y 2 


. The divergence of this field is 


(x 2 +y 2 +z 2 ) 5 ^ 2 ! 

__ (x 2 +y 2 + z 2 )-3z 2 

dz (x 2 + y 2 + z 2 ) 3 / 2 (x 2 + y 2 + z 2 ) 5 / 2 


i . Jfi Jfi dfs i. ^ 
their sum —— + —- + - = divF 

ox oy oz 

the flux of F across various surfaces. 


0. In the next examples we will compute 

□ 


Find the flux of F outward 


Example 8.13. Let F(*,y,z) = , 0 0 

(v z +y z +z j r' 

across a sphere Sr of radius R centered at the origin. We cannot apply the 
Divergence Theorem since F is not defined at (0,0,0). So we compute the 
surface integral. At each point on the sphere, the direction of F is radial and 

normal to the surface, so F • N = ||F||. At each point on Sr, ||F|| = —. Therefore 
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Fig. 8.18 Left: A regular set D with the origin interior, Right: A regular set W that does not contain 
(0,0,0). See Example 8.14. 


(x,y,z) 


Find the flux of F outward 


Example 8.14. Let F(*,y,z) = , 0 0 

(x z +y z + z z r /z 

across the boundary of a regular set D that contains the origin in its interior. 
Because F is not defined at the origin, we cannot apply the Divergence Theo¬ 
rem directly to compute the flux of F across dD. Since (0,0,0) is in the interior 
of D , there is a small sphere Sr of radius R centered at (0,0,0) contained in 
the interior of D. See Figure 8.18. The region between Sr and dD is a regu¬ 
lar set that does not contain (0,0,0). Call it W. We can apply the Divergence 
Theorem to F on W and get 


f F Ndcr= f c 

Jdw Jw 


divFdV. 

In Example 8.12 we found divF = 0. Therefore J divFdV = 0. By the Diver- 

Jw 

gence Theorem the outward flux of F across 

dW = (0D)US R 

is also zero. Using the result from Example 8.13 we get 

0= f F• Ndcr = f F-Ndcr- f F-Ndcr= f F-Ndcr-47r. 

JdW JdD JS R JdD 

Therefore I F • N dcr = An. □ 

JdD 
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8.15. Let F(X) = X. Use the Divergence Theorem to evaluate 

f F-Ndcr 


'dD 


for each of the following sets D in R 3 . 

(a) D is the unit ball, ||X|| < 1. 

(b) D is a ball, ||X - A|| < r, A and r > 0 constant. 

8.16. Let D be a regular set in R 3 , let N be the outward pointing unit normals to dD , 
and let F be a C 2 vector field. Show that 



8.17. Use the Divergence Theorem to evaluate the integrals over the sphere S given 
by x 2 + y 2 + z 2 = 8 2 with unit normals N pointing away from the origin. 



(a) (x + 2 y, 3 y + 4 z, 5 z + 6x) • N d<x 


8.18. Use the Divergence Theorem to find the flux of F outward across the boundary 
of the rectangular box D = [a,b] x [ c,d] x [e,f]. 

(a) F(v,y,z) = (p + qx + rx 2 , 0,0), p, q, r constants 

(b) F(v,y,z) = (0,p + qx + rx 2 ,0), p, q, r constants 

(c) F = V/z, if h is a C 2 function with Ah = 0. 

(d) F(jc,y,z) = (e^,e>',e z ). 

8.19. Let F be a C 1 vector field on a regular set D in R 3 . Justify the following steps 
to show that 



Here N is the outward unit normal and the integral of a vector means to integrate 
each component of the vector. 


(a) (N x F) • C = N • (F x C) where C = (c \, C 2 , C 3 ) is a constant vector. 
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(b) 

(c) 

(d) 

(e) 

(f) 


f N-(FxC)dcr= r 
JdD JD 

[ N-(FxC)d cr= [ (curlF)-CdF 
JdD JD 


8 Divergence and Stokes’ Theorems and conservation laws 
div(FxC)dV. 


f< 

JdD 


(NxF)-Cdcr- f(curlF)• CdV = 0. 
Jd 


f NxFdcr- f curlF dVJ • C = 0. 

JdD Jd / 


JdD 

f NxFdcr- f 

JdD Jd 


curlF dV = 0. 


8.20. Let p be a C 1 function on a regular set D in R 3 . Justify the following steps to 
show that 


f WpdV = f ] 
Jd JdD 


Npdcr. 


Here the integral of a vector means to integrate each component of the vector. Let 
C = (ci,c 2 ,c 3 )bea constant vector. 


(a) 

(b) 0 

(c) 


f N • (pC)dcr = f div(/?C)dV = f (V^ • C + pdivC) dV = f Vp-CdV. 
JdD Jd Jd Jd 

= f Npdcr-C- f VpdV-C= f Npdcr- f VpdVLC. 

JdD Jd \JdD Jd ) 

I Npdcr = J VpdV. 

JdD Jd 



Fig. 8.19 The sum over all the faces, of face areas A times unit normal vectors N is the zero vector, 
in Problem 8.21. 


8.21. Suppose a polyhedron D has faces S\,...,Sk- Let N ; be the outward unit nor¬ 
mal vector of S',-. Justify the following items to prove that the sum of areas times 
unit normal vectors of the faces is zero: 

k 

^ Areal.S',) N, = 0. 

i= 1 


See Figure 8.19. 
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(a) Use the Divergence Theorem for the constant vector field F = (1,0,0) to show 
that 


/. 


n\ dcr = 0 , 


where N = (ni,n 2 ,ni) is the outer unit normal. 


(b) Show that I N dcr = 0, where the integral of a vector means to integrate each 
JdD 

component of the vector. 


(c) For each face of D , f N/ dcr = Area (S ON/. 

JSi 


8.22. Denote the inverse square vector field F(x,y,z) = 
pie 8.12 as 

F(X) = ||X||- 3 X, 


z) 

(x 2 +y 2 +z 2 ) 3 ' 2 


of Exam- 


and define a vector field G(X) = F(X - A), a translation of F by a constant A. 


(a) What is the domain of G? 

(b) Show that divG = 0. 

(c) Show that the flux 



G-Ndcr 


of G outward through the boundary of a regular region W is zero or 4n according 
to whether A is outside or inside W and A is not on dW. 


8.23. Let G(X) = cqF(X - Ai) + • • • + c n F(X - A n ) be a linear combination of inverse 
square vector fields, where F(X) = ||X||~ 3 X. 

(a) What is the domain of G? 

(b) Show that divG = 0. 

(c) Show that the flux of G out of each regular set W is 



•Ndcr = 


L 4nc ^ 

A k in W 


where the sum is over indices k for which A^ is in the interior of W and none of 
the Ak is on the boundary. 


8.24. Let X = (x\,X 2 ,xs) and let H(X) = (||X|| 3 X) JCi , the first x\ partial derivative 
of the inverse square vector field. See Figure 8.20. 

(a) Show that H = ||X|r 3 (l,0,0)-3xi||X|r 5 X. 

(b) What is the domain of H? 

(c) Show that divH = 0. 

(d) Use the Divergence Theorem to show that the flux of H out of every regular set 
D that does not contain the origin is zero. 
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Fig. 8.20 The field H in Problem 8.24, drawn in any plane containing the x\ axis. 


(e) Evaluate 



H-Ndcr 


directly when B is the region where ||X|| < 1. Explain why the Divergence Theo¬ 
rem does not apply here. 


8.25. Let u and v be C 2 on a regular set D in R 3 . Show that div ( vVu ) = vAu + Vv • Vm, 

and use the Divergence Theorem to derive 

(a) I (uAu + |Vw| 2 )dV = | uVu-Ndcr 

Jd JdD 

(b) | (vAu - uAv) dV = I (vVu - uVv) • N dcr. 

Jd JdD 

8.26. Let f(x,y,z ) = sin(v)sin(y)sin(z). 

(a) Show that Af = -3/ in the cube [0, 7 r ] 3 and that / = 0 on the boundary of the 
cube. 

(b) Suppose a function g has the properties Ag = 3g in the cube and g = 0 on the 
boundary. Use the result in Problem 8.25(a) to show that g is identically 0 in the 
cube. 


8.27. Let D be a regular set in R 3 . Suppose / is a C 2 function on D and A a number 
such that 

Af = Af 

in D and / = 0 on dD. Show that if A > 0 then / is zero in D. 


8.28. Define F(X) = (||X|| 2 - 2)X, where X = (x,y,z), and let D be the set ||X|| < r 
where r is a positive number. Lind r so that 


X 


divFdV = 0. 
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8.3 Stokes’ Theorem 

Green’s Theorem says that for a C 1 vector field F(v,y) = (fi(x,y), f 2 (x,y)) on a reg¬ 
ular region D in the plane we have 



Suppose we have a flat surface S that lies in a plane z = c in R 3 . Denote by S xy the 
“shadow” of S in the x,y plane, that is the set of points (x,y,0) such that (x,y,c) is 
in S. See Figure 8 . 21 . We see for G(x,y,z) = (gi(x,y,z),g 2 (x,y,z),g 3 (x,y,z)) 



Therefore 



(8.9) 



Fig. 8.21 Left: A flat surface S and its shadow. Right: A surface that is the union of polygons. 

Equation 8.9 also holds for surfaces in every plane, not just in planes parallel to 
the x,y plane. Hence if our surface is the union of finitely many plane polygon faces 
S i that meet along their edges then we can use the additivity of surface integrals and 
line integrals to add the formulas for the faces: 
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fs 


(curl G) • N dcr 


-zj s 

i= 1 JS * 

n r 

■S 
-I 


(curl G) • Ni dcr 


dcr 


G-Tds. 


See Figure 8.21. This suggests Stokes’ Theorem, that equation 8.9 holds for piece- 
wise smooth surfaces that are images of sets where Green’s Theorem holds. 

Theorem 8.4. Stokes’ Theorem. Let Gbe a vector field that is C 1 on a piece- 
wise smooth oriented surface S in R 3 whose boundary dS is a piecewise 
smooth curve, and that the domains of the parametrizations of S are regu¬ 
lar sets in R 2 . Then 

f curlG• Ndcr = f G-Tds, (8.10) 

JS JdS 

where the orientation of the unit normal vector N to S and of the unit tangent 
vector T to the boundary are chosen as in Figure 8.22. 



Fig. 8.22 A surface S oriented two ways, and the corresponding boundary orientations in Stokes’ 
Theorem. 


We give two proofs. The first proof approximates the surface by triangles and 
applies Green’s Theorem to those. The second proof uses Green’s Theorem on the 
preimages of S. 

Proof. (Approximation argument) Let X from D in the u, v plane to R 3 be a 
parametrization for one of the smooth surfaces that compose S. Since X m (m,v) and 
X v (m, v) are linearly independent, a triangular region T with vertices 

(m 0 , vo), (wo + /z,v 0 ), (uo,vo + h) 

is mapped to a “curved triangular” region X(T) on S with vertices 


X(mq,vo), X(uo + h,vo), X(uo,vo + h). 
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Denote by T' the triangle in R 3 with those vertices. See Figure 8.23. 



Fig. 8.23 Triangles T and V in the proof of Stokes’ Theorem. 


By the Mean Value Theorem 

X(u 0 + h, v 0 ) - X(«o, v 0 ) = X u (u, vo)h, 

X(uo,vo + h)- X(m 0 , v 0 ) = X v (m } , v)ft. 

The norm of the cross product of these vectors is twice the area of the triangle T'. 
The area of T is \h 2 . Since X u and X v are continuous functions, for every e > 0 we 
can choose h so small that for each point in T 

j> U (u, v) xX v (m, v) - x u (u\ , Vi) xX v Oi, vi)|| du dv < e(\h 2 ). 

Denote by X\(u,v) the linear approximation of X(u,v) through T'. Since the first 
derivatives of X(u , v) are continuous, for every e > 0, for h small enough 

||curlG(X(M,v))-curlG(Xi(M,v))|| < 6. 

Also since X\(u,v) is the linear approximation of X(u,v) through T', for h small 
enough 

||XO,v)-XiO,v)||<e 

for all (m, v) in the triangle T. We show that the difference between the flux of curlG 
across X(T) and the flux of curlG across T' is small: 

f (curlG) • Nd<r- f (curlG)-Ndcr 

JX(T) JT' 

= |J" ^(curl G(X(m, v))) • X M x X v - (curl G(Xi (m, v))) • Xi„ x X iv jdwdv . 
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\dudv 


d u dv 


By the triangle inequality this is 

< J ((curlG(X( m,v))- curlG(X i(m,v)))-X„xX v ) 

+ £ ((curl G(X|(m, v))) • (X B x X v - X lu x X lv )) 

< J ||curl G(X(u, v)) - curl G(Xi (u, v))|| ||X„ x X v || dw dv 

+ £ ||curlG(Xi(M,v))|| ||X„ xX y -X„(M,vo)xX v (M 0 ,v)||dwdv 
h 2 

< ke Area (T) -ke—. 

Area ( D ) 

for some constant k that depends on the maximum of ||curlG||. There are —— 

triangular regions in the region D. The error in using triangular regions to approxi¬ 
mate 

(curl G) • N dcr 


/ < 


is bounded by the sum of the errors in each triangle, 


Area(Z))/ h 2 \ 

—Y ) = Area ^^ 6 * 


By taking h small enough we can get the difference between the integrals less than 
6 : 


(curlG)-Ndcr,- f 

(curlG)-Ndcr 

| J U;X(T;) JL 



< 6 . 


( 8 . 11 ) 


Now in each flat triangle T ' we use Green’s Theorem, 


x. 


(curl G) • N d<x = 


G Td5 


to get 


r, 


■ r' 

JdT'i 

(curlG)• Ndcr- f G-Td^ = 0. 

JdT' 


( 8 . 12 ) 


We use an argument on the boundaries similar to the one we used for the triangular 
regions to show that for h small enough 


f G-Tds- f G-Tds 

Ju idT f . JUidX(Ti) 


< e. 


(8.13) 
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Similarly for h small enough 



(8.14) 


and 


and 


|Jas JuidXiTi) 

Adding relations (8.11)—(8.15) we get by triangle inequalities that 



(8.15) 


I f (curlG)• Ndcr- f G-Tds 



can be made as small as we like by taking h small enough. This proves (8.10) of 


Theorem 8.4. 


□ 



Fig. 8.24 Preparing to view Stokes’ Theorem as a curved version of Green’s Theorem. 

The second proof uses Green’s Theorem in a different way. See Figure 8.24. 

Proof. We consider first the case where the surface S is the range of a single smooth 
parametrization X from the u , v plane to R 3 . So S = X(£>), where D is a regular set 
in R 2 and X(dD) = dS. Then 



By the Chain Rule and some simplifying that we ask you to carry out in Prob¬ 
lem 8.38 we see that 


(curl G)(X(u, v)) • X M xX v = (G(X(k,v))) m -X v -(G(X(m,v))) v -X m . 
We define a vector field on D in R 2 by 


F(m, v) = (G(X(m, v)) • X u (u, v), G(X(m, v)) • X v (m, v)); 
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then 


curlF(w,v) = -^(G(X(m,v))-X v (m,v))- ^(G(X(m,v))-X„(m,v)) 

= G (X(«, v)) • X vu (u, v) + X v (m, v) • |-G(X(m, v)) 

ou 

- G(X(m, v)) • X hv (m, v) - X„(m, v) • 2-G(X(w, v)) 

<9v 

= (curl G)(X(m, v)) • X u x X v . 


So 


; dt/dv 


f (curl G) • N dcr = f (curlG)(X( m,v))-X„xX v 
J s Jd 

= J curl (G(X(m, v)) • X u (u, v), G(X(tr, v)) • X v (u, v)) du dv = j curlFdMdv. 


By Green’s Theorem 


I curlFdt/dv= 

Jd Jd£ 


F-Tds. 


JD JdD 

Now let R(0 = (w(0, v(0), a<t <b parametrize dD. Then X(R(0) parametrizes dS . 


f F-Tds= f 

JdD Ja 


F(R(r))-R'(r)dr 


(G(X(R(r))) • X„(R (f)), G(X(R(r))) • X v (R(f)),) • R'(r) dr 

(gl*« + #2;y« + gl + gUv + giZv) ■ R'O) d t 

(igix u + giyu + g3ZuW + (gix v + g 2 y v + g3ZvW) dr 

= I g l dx + g 2 dy + gi,dz= I G-Tds. 

JdS JdS 

This concludes the case where S has a single parametrization. 

Suppose now that S is a union of such surfaces that meet pairwise on common 
edges. We assume these are oriented so that the line integrals associated to adjoin¬ 
ing parts cancel on these common edges. See Figure 8.25. The edges that are not 
common to two parts constitute dS , so Stokes’ formula for S is obtained by adding 
the Stokes’ formulas for the parametrized parts of S. □ 

In Example 3.32 we saw that if F = V/ then curlF = 0. We now show that under 
an additional condition on the domain of F the converse holds. Suppose that every 
closed loop C in the domain of F is the boundary dS of a smooth surface S in D. 
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N 



Fig. 8.25 A surface parametrized in two parts, in the proof of Stokes’ Theorem. The common edge 
is drawn apart to show why the line integrals cancel there. 


By Stokes’ Theorem if curlF = 0 we have 



and by Theorem 7.1 there is a potential function g so that Vg = F. Domains that 
have the property that we need are called simply connected. 

Definition 8.2. We say that a set in R” is simply connected if it is connected 
and if every simple closed curve in the set can be shrunk continuously within 
the set to a point. 

We state without proof the following result: If a set is simply connected then 
every piecewise smooth simple closed curve in the set is the boundary of a piecewise 
smooth surface in the set. 

So we have the following theorem. 

Theorem 8.5. Suppose a vector field F is C l on an open simply connected set 
U in R 3 on which curlF = 0. Then there is a function g from U to R so that 


Vg = F. 


Example 8.15. The set D\ of points in R 3 with ||X|| A 0 is simply connected. 
The set D 2 of points in R 3 not on the z axis is not simply connected. 

If Fi is a C 1 vector field with curlFi = 0 on D\, then Fi has a potential 
function. 

If F 2 is a C 1 vector field with curlFi = 0 on £> 2 , then we cannot conclude 
anything about the existence of a potential function for F 2 . □ 
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Fig. 8.26 The hemisphere in Example 8.16. 


Example 8.16. Verify Stokes’ Theorem for F(x,y,z) = (. z 2 ,-2x,y 5 ) and S the 
upper hemisphere of radius 1 centered at the origin. See Figure 8.26. We com¬ 
pute curlF(v,y,z) = (5y 4 ,2z,-2). At each point (v,y,z) on S, the unit normal 
vector is N = (v,y,z). 


2) • (x,y,z) dcr 


f (curlF) -Ndcr = f (5y 4 ,2z,~ 

Js Js 

= I 5y 4 xdcr + I 2zydcr- | 2zdcr. 
Js Js Js 


By symmetry the first two integrals are zero. Using the parametrization of the 
upper hemisphere 


X(*,y): 


: (x,y, ^1 -x 2 -y 2 ), 0< x 2 +y 2 <1, ||X. t xX v 


^1 -x 2 -y 2 


with domain the disk D : 0 < x 2 + y 2 < 1 we get 


f(curlF) • Ndcr = - f 2 z dcr = -2 f 
Js Js Jd 


Vu 


: dxd y = -2 Area (D) = -2 n. 


To compute 


f - 


Vi -x 2 -y 2 

F • T ds we use X(t) = (cos t , sin t, 0), 0 < t < 2n to parametrize 


dS consistent with the upward pointing unit normals on S. We get 


x 


p2n 

F • T dx = (0, 

Jo 


r>Z7T 

-2 cos sin 5 1) • (-sinf,cos£,0)d£ = I -2 cos 2 tdt = -In. 

Jo 


□ 

Example 8.17. LetF (x,y,z) = (^2>^2’°)' 

(a) Find the circulation of F around a piecewise smooth simple closed curve 
Ci that does not encircle or intersect the z-axis. 

(b) Find the circulation of F around a piecewise smooth simple closed curve 
C 2 that encircles the z-axis once (but does not intersect it). 


See Figure 8.27. For (a) let S' be a piecewise smooth surface with dS = C\. 
Then since curlF = 0 we have by Stokes’ Theorem 
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Fig. 8.27 Left: The curves and Right: the surface in Example 8.17. 

f F • T ds = f curlF -Nd<x = 0. 

JCi=dS JS 

For (b), since F is not defined on the z axis, curlF is not defined there either. 
Every surface with boundary C 2 would include a point on the z axis. Stokes’ 
Theorem cannot be applied on such a surface. Consider a circle Cr of radius R 
centered on the z axis in a horizontal plane z = k that does not intersect C 2 . Let 
S' be a piecewise smooth oriented surface whose boundary is Cr U C 2 . Stokes’ 
Theorem applied to this surface gives 

f F • Tds + f F-Tds = f F-Tds = f curlF-Ndcr= f 0 dcr = 0 . 

Jc 2 JCr JdS=C R UC 2 Js Js 

Suppose C 2 encircles the z axis “counterclockwise” as in Figure 8.27. Then 
Cr goes clockwise. Parametrize Cr by X(t) = ( Rcost,-Rsint,k ), 0 < t < 2n. 
We get 


f F Td^+ f 

Jci JCr 


0)d t 


F-Td^+ F-Tds 

Jc 2 Jc R 

r „ r 2n /Rsint Rcost \ / 

‘Jc, dS+ Jo - Scos '- 

= f F• Td^ + f (-l)df= f F-Td^-2;r. 

Jc 2 Jo Jc 2 

The circulation of F around C 2 is 2n. Similarly if C 2 encircles the z axis once 
in the other direction the circulation is —2n. □ 

Consider two smooth surfaces S 1 and S 2 that have the same oriented boundary 

dS 1 = OS 2 

as illustrated in Figure 8.28, and a vector field F that is C 1 on both surfaces. Since 

F-Td j. 


r F-Tdr 

JdS 1 JdS 2 


Stokes’ Theorem implies that the fluxes of curlF across S 1 and across S '2 are equal, 
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where the normal vectors N are consistent with the boundary orientation. 

Example 8.18. Let F(x,y,z) = (z 2 ,-2x,y 5 ) and let S be the upper unit hemi¬ 
sphere as in Example 8.16. Let S \ be the unit disk x 2 +y 2 < 1 in the x,y plane, 
with unit normal vectors (0,0,1). Then 




□ 



Fig. 8.28 Two surfaces S i and S 2 that have the same boundary. 

Electromagnetism examples. Stokes’ Theorem is often used in the study of elec¬ 
tromagnetism. The electric field denoted E [force/charge] and magnetic field B 
[force/charge/velocity] are known to satisfy the system of Maxwell’s differential 
equations 


p 

// 060 E t = curlB -jUqJ, B t = -curlE, divE = —, divB = 0, 


where yUo,eo are constants, p [charge/vol] is the density of electric charge, and J 
[charge/time/area] is the current density. Several of the Problems explore relations 
among these to illustrate the use of vector calculus. 


Problems 

8.29. Use Stokes’ Theorem to evaluate the line integrals 
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(a) F(v,y,z) = (0,v,0) and C is the circle x 2 +y 2 = 1 in the plane z = 0, traversed in 
the counterclockwise direction as you look at the plane from the positive z axis. 

(b) F(x,y,z) = (y, 0,0) and C is the triangular path consisting of the three line seg¬ 
ments from A = (a, 0,0) to B = (0, b, 0) to C = (0,0, c) to A, where a , b , c are 
positive. 

8.30. Verify Stokes’ formula 



for the following cases by evaluating the integrals on both sides. 

(a) F(v,y,z) = (~y,x, 1), and S is the top face of the cube [0, l] 3 , with N toward +z. 

(b) F(v,y,z) = (~y,x, 1), and S consists of the other five faces of the cube [0, l] 3 , with 
N inward. 

8.31. Verify Stokes’ formula 



for the following cases by evaluating the integral on both sides. 

(a) F(v,y,z) = (-y,x,2), and S is the hemisphere x 2 +y 2 +z 2 = r 2 , z > 0, r constant, 
with N toward +z. 

(b) F(x,y, z) = (-y, x, 2), and S is the disk x 2 +y 2 < r 2 , z = 0, r constant, with N toward 
+z. 



S 


Fig. 8.29 The flow in Problem 8.32. 


8.32. A C 1 vector field V models the fluid velocity in a rotating storm as in Fig¬ 
ure 8.29. The surface S in the figure is a vertical cylinder open at top and bottom, 
where C\ and C 2 are boundary circles, and Vi and V 2 denote the values of V on C\ 
and C 2 , respectively. We suppose two properties hold: 

(a) curlV = (0,0 ,h(x,y,z)) is parallel to the axis of the cylinder, and 

(b) Vi and V 2 in the figure are tangent to C\ and C 2 , respectively, and have constant 


norms ||Vi||<||V 2 ||. 


Use Stokes’ Theorem to show that the properties are not consistent with each other. 
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8.33. Suppose S is a sphere and the vector field F is C 1 on S . Show that 

| curlF-Nd<x = 0. 


's 


8.34. Let S be the hemisphere v < 0 of the unit sphere centered at the origin in R 3 , 
oriented with normal pointing away from the origin. Evaluate 


curl ( x 3 z , x 3 y, y) • N dcr. 


8.35. A wire of radius R along the z axis (see Figure 8.30) carries a constant current 
density J = (0,0, j) and there is a magnetic field of the form 



(r<R) 
0 r>R ) 


where r = yj 'x 2 +y 2 and c \, C 2 and p are some constants. Outside the wire J = 0, and 


there is no dependence on the time t in this problem. 

(a) Use the Maxwell equation 0 = curlB -poJ inside the wire to find c \. 

(b) Find p so that 0 = curlB -poJ holds outside the wire. 

(c) Find C 2 so that B is continuous in R 3 . 

(d) Let D be a disk of radius R\ > R , center on the z axis, parallel to the x,y plane 
and with normal N aligned with J. Then curlB is not continuous on D. Show that 
Stokes’ formula 



holds. Be sure to explain the meaning of the surface integral of the discontinuous 
function. 

8.36. We say a vector field F is a vector potential of a vector field G if G = curlF. 
Show that if F is a vector potential of G then the flux of G across a surface S depends 
only on the values of F on the boundary of S . As a result, we say if G has a vector 
potential its flux is “independent of surface.” 

8.37. Ampere’s original law related magnetic field B along the boundary of each 
oriented surface S with the electric current J passing through S by 



po a constant. 

(a) Use Stokes’ formula to deduce 


curlB = poJ. 
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Fig. 8.30 Left: Current in a wire for Problem 8.35. Right: The displaced surface acquires displace¬ 
ment current in Problem 8.37. 


(b) Show that part (a) contradicts the conservation of charge law 

p t + divj = 0 , 

where p is the density of electric charge that can vary with time. 

Remark. Maxwell later replaced Ampere’s law curlB = poJ with 

curlB = yu 0 (J + epE*), 

calling 6 oE r the “displacement current.” This change allowed the unification of light 
with electromagnetism. See Figure 8.30 for a sketch of a surface displaced into the 
region of changing magnetic field in a capacitor. 

8.38. Show that if G is a continuously differentiable vector field in a set containing 
a smooth surface S , and S is parametrized by X(u , v) with domain D in the plane 
then 

((curl G)(X(u, v))) • (X u x X v ) = (G o X) u ■ X v - (G o X) v • X„. 

We used this formula in our second proof of Stokes’ Theorem. 

8.39. Use Stokes’ formula to deduce 

f f B • Ndcr = - f E-Tdi 
d* Js Jos 

from the Maxwell law B r = -curlE. See Figure 8.31. 

Remark. This relates to generation of electric power. 


8.40. Which of these sets are simply connected? 

(a) The set of all points (x,y,z) in R 3 that are not on the x axis. 

(b) The set of all points in R 2 other than (0,0). 

(c) The set of all points in R 3 in a solid torus (doughnut). 
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B 



Fig. 8.31 Changing magnetic flux is related to electric field in a loop, in Problem 8.39. 

8.4 Conservation laws 

We illustrate an application of multivariable calculus by discussing conservation 
laws. A conservation law expresses the fact that the rate at which the total amount of 
some substance (mass, momentum, energy) contained in a set D changes is equal to 
the rate at which that substance enters the set. We denote the density of the substance 
as s [substance/volume], and denote the rate at which it flows [substance/area/time] 
as F. Then the total amount of the substance in D at time t is 



The amount of substance that flows out of D per unit time is 



where the unit normal vectors N are outward. According to the conservation law the 
rate at which the amount of substance contained in D changes, 



is equal to the rate at which the substance flows inward through the boundary of D. 
So conservation of substance is expressed by the equation 



( 8 . 16 ) 


On the left we can carry out differentiation with respect to t under the integral sign, 
and on the right we use the Divergence Theorem to transform the integral over the 
boundary of D into an integral over D. We get 
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f s, dV = - f 
Jd Jd 


divF dV. 


Combining the two sides we get 


I (^v + divF) dV = 0. 
Jd 


Since this relation holds for all regular regions D it follows that the integrand must 
be zero (see Problem 6.17): 

^ + divF = 0. (8.17) 

This is the differential form of a conservation law. 


Conservation laws for fluid flow. We shall describe now the three basic conser¬ 
vation laws of mass, momentum, and energy of a fluid as instances of the general 
conservation law (8.17). 

Fluid dynamics, the study of the flow of fluids, is an extremely interesting and 
important branch of science, encompassing aspects of mathematics, physics, and 
engineering. It is basic for understanding the flight of airplanes. 

A. Conservation of mass. We denote the density [mass/volume] of a fluid by the 
Greek letter p. Flow velocity [distance/time] is denoted by V = (u,v,w). Therefore 
the rate at which material is transported is pV [mass/area/time]. Setting 

s = p, F = pV 

into the conservation law (8.17) gives the law of conservation of mass: 

p, + div(pV) = 0. (8.18) 

For an incompressible fluid the density p is a constant, independent of space and 
time; for such a fluid the equation of conservation of mass is then 

divV = 0. (8.19) 

B. Conservation of momentum. As above we denote the density of the fluid by p 
and velocity by V = (u,v,w). The pressure is P [force/area]. 

We will show that 

p(V; + V-VV) = -VP, 

accel. 

a version of Newton’s law of motion. 

The density of momentum in the v direction is the x component of pV, pu. We 
assume it is transported into a set D by two mechanisms: 

(a) It is carried by the substance flowing across dD into D. 

(b) It is imparted by the v component of the fluid pressure acting on dD. 


We describe now these mechanisms in detail. 
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a) The rate at which v momentum is carried into the domain D by the flow is 

puY • N dcr, 


7 ' 


where N = (n 1 ,^ 2 ,^ 3 ) is the outward unit normal. 

b) The rate at which v momentum is imparted to the fluid at the boundary is the 
pressure force on dD in the v direction, 


7 ' 


Pn\ dcr. 


We assume no forces act other than pressure forces. The total rate of change of x 
momentum is the sum 


7 < 


(puY - N + Pn\)dcr. 


Thus the law of conservation of x momentum in the form (8.16) is 

x f pudV=- f (pu\-N + Pn l )dcr = - f (puV + (P,0,0))-Ndcr. 
dt Jd JdD JdD 

In the differential form (8.17) it is 

(pu) t + div (puY + ( P\ 0,0)) = 0. 


We use the mass conservation equation (8.18) to express p t as -div(pV) in the equa¬ 
tion above; we get 

pu t - t/div (pY) + div (puY) + P X = 0. 

Using differentiation rules we have di xipuY) = Yu- (pY) + udi\(pY). Substitute and 
simplifying we get 

pu t + pYu • V + P x = 0 


or 


p(u t + uu x + vu y + wu z ) + P X = 0. 

Similar equations hold for the other two coordinates; we write all three as the vector 
equation 

( 

P 


Uf 

d 

u 

d 

u 

d 

u 


Px 


0 

Vt 

+ u — 

V 

+ v— 

V 

+ w— 

V 

+ 

Py 

— 

0 

dx 


dy 


dz 



y 


0 

w t 


w 

w 

w 

/ 




or 


p(Y t + Y • VV) + YP = 0. (8.20) 

We ask you in Problem 8.44 to identify the vector 


u t + uu x + vu y + wu z 
v t + uv x + vvy + wv z 

W t + UW X + VWy + ww z 


= V, + V-VV 
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as the acceleration X"(0 of the fluid at the point Thus equation (8.20) 

states that 

(density) (acceleration) + V(pressure) = 0, 
a version of Newton’s law of motion. 


C. Conservation of energy. The total energy in a fluid contained in a set D is 
the sum of its kinetic and internal energy. Internal energy density, denoted as e , is 
defined as internal energy per unit volume, and depends on the density of the fluid p 
and fluid pressure P. We assume energy is imparted to a set D by two mechanisms: 

(a) Energy carried by the substance flowing across the boundary into D , 

(b) The work done by the fluid pressure on the boundary of D. 

The rate at which energy, both internal and kinetic, is flowing out of D is 



jpV • V)V -Nd<x. 


( 8 . 21 ) 


where N is the outward normal. Therefore the rate at which energy is flowing into 
D is the negative of (8.21). 

The rate at which the fluid pressure does work on the boundary of D is 




PY • N dcr. 


Therefore the total rate at which energy is imparted to D is 



ipV-V + P)V-Ndcr; 


the function e + • V + P is called enthalpy. 

The total energy E(D) of fluid contained in D is the sum of the internal energy 
and kinetic energy of fluid contained in D : 

E(D)= f (p+ipV-V) dV. 

Jd 

Therefore the conservation law in the form (8.16) is 


d_ 
d t 


f (e + jpV • V) dV = — f (e + 
Jd JdD 


\p\ ■ V + P)\ ■ N dcr. 


In the differential form (8.17) it is 

e, + ip, V • V+pV • V, = -div ((e + ipV • V + P) v). 
Rearrange this equation as 


e, + div (e\) = - \p ,V • V - pV • V, - div ((ipV • V + P) v). 
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We show now that the right hand side simplifies to -PdivV. By the differentiation 
rule div (/W) = /div W + W • V/ we get 

div ((ipV • V + / J )V) = ±div (pV)V • V + ±pV • V(V • V) + div (P\). 

Therefore 


e t + div (eV) = -±p f V • V -pV • V, - ±div(pV)V • V - ±pV • V(V • V) - div (PV). 

Using the mass conservation law p t + div(pV) = 0 the first and third terms cancel on 
the right side, giving 

e t + div (eV) = -pV • V, - ±pV • V(V • V) - div (PV). 

Using again div(/W) = /div W + W • V/ the right side is 

= -pV • Vf - ipV • V(V • V) - V • VP - PdivV. 

In Problem 8.43 we ask you to verify that V • V(V • V) = 2V • (V • VV). Therefore the 
last expression becomes 


= -V • (p\ t +pV • VV + VP) - Pdiv V, 
and by the momentum conservation law (8.20) this is 

= -PdivY. 


So the energy equation is 


e t + div(^V) = -Pdiv V. (8.22) 

The energy equation has to be supplemented by an equation of state, that specifies 
the internal energy e as a function of density and pressure. The three conservation 
laws (8.18), (8.20), and (8.22) supplemented by an equation of state are the equa¬ 
tions governing the flow of fluids when the only force is that due to the gradient of 
pressure. 


Problems 


8 . 41 . The integral form of the mass conservation law is 


d 
d t 




•Ndcr. 


Take the case where V = ( u , 0,0) is aligned with the v axis, p and V depend on v 
only, and the set D is a cylinder of cross section area A, with axis a < x <b. Show 
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that the mass conservation implies 

0 = (p(b)u(b) - p(a)u(a)}A. 

8 . 42 . Show for all differentiable functions p and V that 

p t + div (pV) = p t + V • Vp + pdiv V. 

8 . 43 . Show for all differentiable functions u , v, and w that 

uu x + vu y + wu z 
uv x + vv y + wv z 
uw x + vw y + ww z 

to verify the formula V • V(V • V) = 2V • (V • VV) that we used in the energy equation. 

8 . 44 . Let X(t) be the path followed by one particle moving with the fluid. This 
means that the velocity agrees with that of the fluid at each point of the path: 

X'(0 = V(X(0,0- 


u(u 2 + v 2 + w 2 ) x + v(u 2 + V 2 + W 2 )y + w(u 2 + v 2 + w 2 ) z = 2 (u, v, w) • 


Use the Chain Rule to show that 


v,+v-vv 


u t + uu x + vu y + wu z 

V t + UV X + VVy + wv z 
w t + uw x + + ww z 


is the acceleration of the particle, that is, 


X"(0 = V,(X(0,0 + V(X(0,0 • VV(X(0,0- 


Remark: Note that V • VV = (DV)V where D\ is the matrix derivative of u, v, w with 
respect to x,y,z. 

8 . 45 . Take the fluid velocity to be V(X) = c||X|| _3 X where c is a constant. 

(a) Show that div V = 0. 

(b) Show that the acceleration (See Problem 8.44) is -2c 2 ||X|| -6 X. 

(c) This is a model for flow in a conical duct. For each of the three cases indicated 
in Figure 8.32, indicate the direction of acceleration. 

Remark: Maxwell used this flow as an analogy for a static electric field. 


8 . 46 . Use either direct calculations or the Divergence Theorem to verify that 


X 


WPdV = 



NPdcr 


for the functions in (a) and (b), where D is the ball of radius r given by ||X|| 2 < r 2 . 
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c > 0 c < 0 

Fig. 8.32 Flows using V in Problem 8.45. 


(a) P(X ) = a||X|| 2 , a > 0 constant 

(b) P(X) = B • X, B ^ 0 constant 

(c) Using the notion of pressure, 



NPdcr = pressure force on D , 


which of (a) or (b) gives a nonzero force on the ball? 


8 . 47 . Let the fluid velocity be 


V(X,0~X. 


(a) Describe the fluid velocity at times t- 0 and at t - 1. 

(b) Show by computing the divergence div V that this is a compressible flow. 

(c) Find a number a so that the functions 

p(x,t) = (i+t) a \\x\\ 2 , v(x,o = jL x 

satisfy the conservation of mass equation 


pt + div (pV) = 0. 


8 . 48 . Take the case where the flow only depends on v and t , and the velocity is 
parallel to the v-axis, V = (u, 0,0). Show that the mass, momentum, and energy 
equations become 


Pt + (pu)x = 0 

Px 

U t + UU X = - 

P 

e t + ( eu) x = -Pu x . 


Remark. Problems 8.49-8.51 introduce sound waves. The wave equation is also 
discussed in Chapter 9. 

8 . 49 . For an ideal gas the pressure and energy are related to the density by 


e = 



P = kp y , 
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where k , c, R , and y are constants. Show that for flows of an ideal gas the energy 
equation (8.22) is implied by the mass equation (8.18) if we set 

t R 

y — 1 H—. 

c 

8 . 50 . Use the result of Problems 8.48 and 8.49 to derive the equations 

p t + (pu) x = 0 
u t + uu x = -kyp y ~ 2 p x 

for flows of an ideal gas. 

8 . 51 . In the equations of Problem 8.50, consider the case where the velocity u is 
very small, and the density p is close to a constant, say 

u = ef(x, t), p = po + eg(x , t), 

e small. Show that 

gt+Pofx = 0 

ft + (kypjp)gx = 0 

approximately, ignoring powers of e. Deduce that g satisfies the wave equation 

gtt = (kypo)gxx- 

8 . 52 . If in addition to the effects we’ve considered, there is a gravitational accelera¬ 
tion (0, -g, 0) acting on the fluid, the conservation law in the y direction is 

^ J* pv dV = - J* (pvV + (0, P, 0)) • N dcr + J* -pg dV. 

Show that the resulting differential equation becomes 

p(v t + uv x + vv y + wv z +g) + P y = 0. 


8.5 Conservation laws and one-dimensional flows 

One way of stating the Fundamental Theorem of Calculus is this: 

f(x)dx = f(b)-f(a), fisC 1 . 

The higher-dimensional analogue of this result is the Divergence Theorem, which 
states that 
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F is C, N is outer. (8.23) 


] D JdD 


As we saw in the last section, the right side of (8.23) has an interesting interpretation 
when F is the rate of flow of some physical quantity like mass, momentum, or energy 
(stuff per area per time). The dot product F • N is the rate of flow in the direction N. 
Therefore the integral on the right side of (8.23) is the rate at which stuff is flowing 
out across the boundary of D. 

The one-dimensional situation is even simpler. There the rate of flow fix) is a 
scalar quantity (stuff per time), denoting the rate at which stuff is flowing in the 
positive x direction. The quantity fib) is then the rate at which stuff is flowing out 
of the interval [< a , b] at its right endpoint, and f(a ) is the rate at which stuff is flowing 
into the interval [a, b] at its left endpoint. So fib) - f(a) is the rate at which stuff is 
flowing out of the interval [a,b]. 

There is another way of calculating the rate at which stuff contained in an interval 
is changing. Denote by p the density (stuff per length) of the stuff under considera¬ 
tion (mass, momentum, energy). The total amount of stuff contained in an interval 
[a, b] is the integral of the density 





If density depends on time t , as well as on position x, so will the total amount of 
stuff contained in [a,b], 



The rate of change of the total amount contained in [a, b] is the time derivative 



Assume now that no chemical reactions take place, therefore stuff is not created 
nor destroyed. Then the only way the total amount of stuff in [a, b] can change is 
through stuff entering or leaving across the boundary of [a,b]. According to the 
previous discussion, the rate at which stuff is leaving [a, b] through its boundary is 
f(b, t) - fia, t). The rate at which the amount contained in the interval [a, b] changes 
is the negative of the rate at which stuff leaves through the boundary, 



(8.24) 


If we assume that p and / are continuously differentiable functions of t and x, we can 
carry out the differentiation with respect to t on the left of (8.24) under the integral 
sign, and rewrite the right side using the Fundamental Theorem of Calculus 


8.5 Conservation laws and one-dimensional flows 


377 


We can rewrite this as 




dx. 


+ fx)dx = 0. 


This relation holds for every interval [a, b] ; from this we conclude that 


(8.25) 


(8.26) 


Pt + fx = 0 (8.27) 

for all values of x and t. For if there were a location xo and time to where (8.27) 
is violated, say (pt + f x )(x oJo) is positive, then, since the partial derivatives p t and 
f x are assumed to be continuous, it would follow that p t + f x is positive in a small 
enough interval [xo - 6,xo + e] in [a,b]. But then the integral 

r*o+£ 

I (pt + fx) dx 

Jxo~6 

would be positive, contrary to (8.27). 

As we saw in the last section, the same analysis was carried out in three dimen¬ 
sions to obtain the law of conservation of mass expressed as the differential equation 

pt + div (pV) = 0. 

Deriving differential equations that the rates of flow must satisfy is only the first 
step in studying flows. The major task is to find solutions of these equations. This 
will tell us how flows behave. As an example, we present, and solve, a simplified 
model of the flow equations in one space dimension. The simplification is that we 
assume that the rate of flow [stuff/time] is a function of density alone, / = f(p). 
Then equation (8.27) for the flow becomes 


Pt + f(p)x = 0. (8.28) 

For the sake of simplicity we also assume that / is a quadratic function of density, 
f(p) = \p 2 - Then the equation for the conservation law is 

Pt+PPx = 0. (8.29) 

If the values of density p at time t- 0 are known, 

p(x, 0) =p 0 (*), 

we shall show how to use equation (8.29) to obtain the values of the density p for 
future times. 

Consider functions x that satisfy the differential equation 


— = p(x, t). 
d t 


(8.30) 


378 


8 Divergence and Stokes’ Theorems and conservation laws 


The graph of x is a curve on which we examine the values of p. Denote the starting 
point of a curve by xo: 

*(0) = *o. 

Next compute the derivative of p(x(t),t) along such a curve. Using the Chain Rule 
we get 

d dx 

—p(x(t),t)=p x —+p t . (8.31) 

According to (8.30), 'jj = P- Setting this into (8.31) gives 

■j~pC*V)> 0 = PxP +Pt- (8.32) 

But according to the differential equation for the conservation law (8.29) satisfied 

d 

by p, the right side of (8.32) is zero! This shows that —p(x(t),t) = 0, which is the 

case only if the density at x(t) at time t, p(x(t),t ), is independent of t. 

It follows that the right side of equation (8.30) is constant and therefore the graph 

dx 

of v is a straight line! The speed — with which this line propagates can be deter¬ 
mined at t = 0: p(x(t)x,t) = p(x o,0), the initial value of p at the point vo- Denote 
p(vo,0) as po; the solution of equation (8.30) is 

x{t) = xq +pot (8.33) 


and the solution of (8.29) satisfies 

P(*0+Po0 =Po- (8.34) 

The geometric interpretation of formula (8.33) is this: from each point vo of the line 
where t - 0 draw a ray propagating with speed po in the positive t direction. Set 
p(x , t) = po on this ray. Using the Inverse Function Theorem we can show that if the 
initial value for the density po(x) is a smooth function of x, then for t sufficiently 
small p(x,t), as defined here, is a smooth function of x and t, and is a solution of 
equation (8.29). We guide you through this argument in Problem 8.62. 

The interesting question is: what happens when t is no longer sufficiently small? 
Suppose that for two values x\ < X 2 , Po(*i) is greater than po(*2)- 

Then the rays given by equation (8.33) intersect at some critical positive value of t. 
At this point of intersection (x, t ), the density p(x, t ) is defined to be both equal to 
Pi = PoCu) and p 2 = Po(*2X a contradiction. This shows that no solution of equation 
(8.29) with such prescribed initial values po(x) exists for t greater than this critical 
value of t. 

The following example points a way of resolving this problem. Take for the initial 
values po the continuous function 
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x (t) = x o + Po t 


t 


1 


'LL 



X 


X 


0 


Fig. 8.33 The rays in the x, t plane on which p(x, t) is constant, have a slope that depends on that 
constant. Note the time evolves upward. 



Fig. 8.34 Two rays may intersect at a certain time. 


1 if jc < — 1, 

Po(x) = < -x if - 1 < x < 0, 
0 if 0 < x. 


(8.35) 


For this choice of po the rays are as follows. 

(a) For xo < -1, x(t) = xo + t. 

(b) For -1 < xo < 0, x(t) = xo - xo t. 

(c) For 0 < x, x(t) = xo. 

For t < 1 these lines look as in Figure 8.35. 



-1 0 


Fig. 8.35 The rays of constant p(x, t ), for the initial value in equation 8.35. 

The rays don’t intersect for t < 1, but as t tends to l, all rays issuing from points 
of the interval -1 < x < 0 intersect at x = 0. So the value of p(x, t) at t = 1 is 



for x < 0 
for 0 < x, 


(8.36) 
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a discontinuous function. 

The solution p of p t +pp x = 0 for this example is indicated in Figure 8.36, for 
0 < t < 1. It satisfies the initial values (8.35). We ask you in Problem 8.57 to verify 
the values of p indicated in the figure. 



Fig. 8.36 Solution p of (8.29) with initial values (8.35). 


We show now how to continue such a solution p of 


Pt + fx - 0 


as a discontinuous solution. This sounds like nonsense, for a discontinuous func¬ 
tion is not differentiable at the points of discontinuity, therefore cannot satisfy the 
differential equation at such points. To give meaning to the concept of a discon¬ 
tinuous solution we have to go back to the integral version of the conservation law, 
equation (8.24), from which the differential version (8.28) was derived. Whereas the 
differential version makes no sense for discontinuous functions, the integral version 
does! 

Suppose p is a function that has a discontinuity across a smooth curve in the x, t 
plane. On each disk that does not intersect the curve, p is continuous. Describe this 
curve of discontinuity as x = y{t) in Figure 8.37. 



Fig. 8.37 A discontinuity of p(x, t) can occur across a curve x = y(t). 


Choose the interval [ a , b] so that the discontinuity y{t) lies between a and b during 
the time interval we are investigating. Since p is continuous on each side of the graph 
of x = y{t) we can write the total amount of stuff in [< a , b] as 
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pdx. 


We differentiate the above integral with respect to t , using the rule (see Prob¬ 
lem 8.53) 


d ryiit) ryi(t) 

— g(x,t)dt = I g t (x, t) At + giyoU), t)y' 2 (t) - g(yi (/), t)y\ (t) 
at Jyi(» 


for differentiating an integral whose integrand and limits both depend on t. We get 


d_ 
d t 


JO-SX 

■£ 


d r y(t) , d 

pdx + — 
d t 


p t dx+p(L)y t + 



(8.37) 


Here p t and y t denote derivatives with respect to t , and p(L), p(R) denote the limiting 
value of p on the left and right sides of the discontinuity. 

We apply the integral conservation law (8.24) to an interval [ a,z] with z < y(t). 
Since p is differentiable on this interval, 


d_ 
d t 


[pAx = r 

Ja Ja 


p t dx = f (a)-f(z) 


Now let z tend to y(t). Then f(z) tends to /(L), the value of / on the left side of the 
discontinuity. So we get 


Similarly 


r 

r 


p,dx = f (a)-f(L) 


p t dx = f(R)-f(b) 


where f(R ) is the value of / on the right side of the discontinuity. Setting these 
relations into the right side of (8.37) yields 


d r f 
Ja 


pdx = f(a) - f(L ) +p{L)y t + f(R) - f(b)-p(R)y t . 


(8.38) 


d r 

According to the integral conservation law, — 

& J a 

elude from (8.38) that 


pdx = f(a ) - f(b). So we con- 


m - m + (p(L) -p(R))y t = o (8.39) 


It is convenient to denote the jump of p and / across the discontinuity by brackets: 
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p(R)-p(L) = {p], 

[/] 


So we can rewrite (8.39) as y t 


lp] 


. The derivative y t of y is the speed with which 


the discontinuity is propagating. Let’s denote it by s = y t . Then 

, = *Q 

l p ] 


(8.40) 



-1 0 


Fig. 8.38 A shock wave at speed \ continues the solution shown in Figures 8.35 and 8.36. 


We return now to our example p t + pp x - 0, where f(p) = \p 2 and p(x , 1) is given 
by the discontinuous, 


p(*A) = 


x < 0 
v > 0. 


The solution p(x , t) for t > 1 consists of two regions separated by a discontinuity 
issuing from the point (0,1); see Figure 8.38. To the left of the discontinuity p(x , t) = 
1, and to the right of the discontinuity p(x,t) = 0. The discontinuity is a straight 
line with speed s = j^j. This moving discontinuity is called a shock wave. Here 
[p] = 0- 1 = -1 and for / = ±p 2 , [/] = 0- \ = -±, so by (8.40) 


s = 


[ft 

lp] 


_ l 

2 

-1 


_ 1 
“ 2* 


Similar calculations of discontinuous solutions of conservation laws for functions 
of two or three variables yield the analogue of formula (8.40), where s is the speed 
of propagation of the discontinuity in the direction normal to the discontinuity. 


Problems 


8.53. Apply the Fundamental Theorem of Calculus, and the Chain Rule in the form 
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to the function g(r , s, t) 


d 
d t 


g(a(t),b(t),t) = Vg-(a',b', 1), 



, t) dx to show that 


a rb(t) rb(t) 

- /(x,0dx = /(K0,^'(0-/W0^V(0+ Mx,t)dx, 

at Ja{t ) Ja{t) 

for C 1 functions /. 


x 

8.54. Show that p(x , 0 = —- is constant along certain rays x = xo + mt and give a 

t + 8 

formula relating m and xo. 

8.55. Show that the function p(x, 0 of Problem 8.54 is a solution of p t + pp x = 0. 


8.56. Verify that the conservation law 


d r b 

— J p(x, t) dx = -fib, t) + f(a , t) 


holds for the function p(x,t) of Problem 8.54, the flux function / = ^p 2 , and the 
interval [a,b] = [2,5]. 


8.57. For 0 < t < 1 let 


T 1, x < t- 1 
p(v,0 = < t~t, t-l <x<0 
{ 0, 0<v 


as illustrated in Figure 8.36. 

(a) Show that p is continuous for 0 < t < 1 . 

(b) Sketch the graphs of p(x, 0) and of p(x , 1). 

(c) Show that p t + pp x = 0 for 0 < t < 1 except along the segments x-t- 1 and x = 0. 

(d) Equation (8.34) applied to this function says that 


p(x 0 + i~xo)t,t) = -xq 9 (-1 < XQ < 0). 


Use this to derive the -—- part of the formula for p. 


8.58. A solution of p t + pp x = 0 has initial value 


T 10, x<0 
p(x,0) = <10-10x, 0<x<l 
( 0, l<x 


At what time t does a shock wave form? 
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8.59. Suppose instead of the conservation law (8.24) there is some mechanism to 
generate stuff within the interval at rate g : 


d_ 
d t 



,t) dx = - f(b , t) + f(a, t) + 



If this holds for all intervals [a, b] and the functions are all continuously differen¬ 
tiable, show that the differential equation form of the conservation law becomes 


Pt + fx = g- 

8.60. For the equation p t + (\p 2 ) x = 0, show that the jump speed formula s = j£j 
gives the speed as 

s=\(p(L)+p(R)), 

the average of the left and right limits at the discontinuity. 



8.61. Use the result of Problem 8.60 to find all the shock speeds indicated in Fig¬ 
ure 8.39. The initial value p(v,0) is a piecewise constant function whose values are 
shown along the x axis. 

8.62. The solution of p t +pp x = 0 with differentiable initial value p(x, 0) = po(x) 
requires that p be constant on rays x = xo +pot. That means we need to find a function 
p(x, t ) that satisfies 

p(x+po(x)t,t) =po(x). 

Define a function F (x,t) = (x+po(x)t,f), so that we need poF(x,t) = po(x). Justify 
the following steps using F to prove the existence of such a function p. 

(a) Find the matrix derivative DF. 

(b) Show that DF(x,t) is invertible when t is small. 

(c) Apply the Inverse Function Theorem to conclude that F -1 is locally defined and 
differentiable. 
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(d) The formula p = po o (the first component of F 1 ) defines a differentiable function 
that solves p t +pp x = 0 for small t and has p(x,0) = po(x). 

8.63. Suppose we draw two rays x = xo + pot, one starting from xo = 0 where we 
assume po = 2, and the second starting from xo = 3 where po = 1.5. Find the critical 
value of t and the value of x at which these two rays intersect. 



p =o 


X 


0 


Fig. 8.40 Values p(x, t) of the solution in Problem 8.64 are indicated in the x, t plane. 

8.64. Consider the equation p t + ( \p 2 ) x = 0 with initial data p(x,0) = 2 if x < 0, 

p(x,0) = 1 if 0 < x < 1, and p(x,0) = 0 if x > 1. See Figure 8.40. 

(a) Show that the speeds of the shock waves starting from x = 0 and x = 1 are 3/2 
and 1/2. 

(b) Find the point J where the shock waves meet. 

(c) Find the speed of the single shock that continues the solution beyond the time at 
point J. 

(d) If you could stand at the point x = 2 at time 0 and wait, would you observe one, 
or two, shock waves passing by? 

8.65. Suppose p t + xp x - 0. Sketch curves in the ( x,t ) plane on which x'(t ) = x(t). 

Show that on those curves, 



so that p is constant on each such curve. Find p(x, t) if p(x, 0) = x 2 . 




Chapter 9 

Partial differential equations 


Abstract In this chapter we derive the laws governing the vibration of a stretched 
string and a stretched membrane, and the equations governing the propagation of 
heat. Like the laws of conservation of mass, momentum and energy studied in the 
previous chapter, and like the electromagnetism laws, these laws are expressed as 
partial differential equations. We derive some properties and some solutions of these 
equations. We also state the Schrodinger equation of quantum mechanics, derive a 
property of the solutions and explain the physical meaning of this property. 


9.1 Vibration of a string 


Imagine a string stretched along the x-axis. When we pluck the string each point of 
the string vibrates in a direction perpendicular to the stretched string, call it the u 
direction. See Figure 9.1 . Assume the vibration is in the x, u plane. The displacement 
u of each point x of the string is a function of x and t. We use Newton’s law, force 
equals mass times acceleration, to derive a partial differential equation satisfied by 
w(x, t). 



Fig. 9.1 A string displacement is shown at one value of the time. The displacement and slope of 
the string are assumed very small everywhere and are exaggerated here for visibility. 


Let T be the magnitude of the tension force in the elastic string, that we take to 
be the same at all points of the string and at all times. Consider small vibrations of 
the string, where the slope of the vibrating string differs little from the direction of 
the undisturbed string. 
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Take a small piece of the string between v and x + h, h positive. There are two 
forces acting on this piece of string, the tension at each end of the piece. Let 6(x) be 
the angle between the tangent to the string at the position x, and the v axis, 


.. . du 

tan 0(x)= 

OX 


(9.1) 


See Figure 9.2. 



Fig. 9.2 Tension forces depend on angles 0(x), 0(x+h) at the ends of a bit of the string. The angles 
are drawn too large in order to make them more visible. 


The forces in the (jc, u) plane acting on the piece of the string at x and x + h are 

- T (cos 6(x) , sin 0(x)), T (cos 6(x + h) , sin 6{x + h)). 

We consider the motion of the string only in the u direction. 

The force on the small piece in the u direction at x + h is T sin#(v + h). The force 
at v is -T sin^(v). The total force is the sum of these two forces: 

total force = T sin0(v + h)-T sin #(.*). 

We use the Chain Rule and the Mean Value Theorem to express the right side as 

d6 

total force = hT cos(0) —, (9.2) 

dx 


where 0 and — are evaluated at some point between x and x + h. Since 0 is small, 
dx 

we approximate cos 0 as 1, and tan# as 6. Since at each point the slope of the line 
tangent to the string is the tangent of 6 we get 


— = tan(0(x)) 
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du 

Replacing 0 by — and cos 6 by 1 in formula (9.2) we get 
ox 


d 2 u 

total force = hT —-. 

dx 1 


(9.3) 


The mass of the piece of string of length h is hW, where W is the mass per unit 
length of the string. The acceleration of the piece of string in the u direction is u tt . 
We apply the law 

total force = mass times acceleration 


to the motion of the piece of string. Using formula (9.3) for the total force acting on 
the piece of string, and hW for its mass, we get 


d 2 u 

hT —- = hWutt. 
dx 2 


Dividing by hW gives 

W = u tt- (9.4) 

Since T and W are both positive, so is ^. We write c = and set it into equation 
(9.4); we get 

u tt -c 2 u xx = 0. (9.5) 

Equation (9.5) is called the one-dimensional wave equation. 

It follows from equation (9.5) that c has the dimension of velocity; but velocity 
of what? We shall show that ±c are the velocities with which certain waves in the 
string propagate along the x direction. 

Example 9.1. Take u(x,t) = cos (x-t). Then u is a solution of the wave equa¬ 
tion with c 2 -l because 


Utt -Uxx = - cos(* - 1) - (- cos(* - 1)) = 0. 

This is a wave that propagates to the right at speed 1. See Figure 9.3. In Prob¬ 
lem 9.5 we ask you to identify the speed and direction that other waves move. 
□ 


cos(x) cos(x-l) cos(x-2) 



Fig. 9.3 The wave in Example 9.1 plotted at times t- 0, 1,2. 
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Example 9.2. The function 

u(x , t) = cos(v - ct) + cos(v + ct) 

is a sum of a wave moving to the right and a wave moving to the left, both at 
speed c. By the addition formula for the cosine function, 

u(x , t) = cosxcos(ct) + smxsm(ct) + cosxcos(ct)-smxsin(ct) = 2cosxcos(ct). 

This is a wave that oscillates up and down. See Figure 9.4. □ 



Fig. 9.4 The sum of waves in Example 9.2 is illustrated at several times. 


It follows from the rules of calculus that every function of the form 
u(x , t) = f{x - ct) + g(x + ct) 

where / and g are twice differentiable functions of a single variable, is a solution of 
u tt - c 2 u xx = 0. The theorem below shows that every solution is of this form. 

We introduce the following notation: D denotes the trapezoid 

a + ct < x<b-ct, t\ <t<t 2 (9.6) 

in the ( x,t) plane. We number its sides as C\,C 2 ,Cs and C 4 . See Figure 9.5. 



Fig. 9.5 The trapezoid defined in (9.6). 
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Theorem 9.1. Every solution u of the wave equation 

Utt ~ c u xx = 0 

in the trapezoid D where a + ct < x<b-ct, t\<t <t 2 is of the form 

u(x,t) = f(x-ct) + g(x + ct), (9.7) 

where f and g are twice differentiable functions. 


We derive Theorem 9.1 from Theorem 9.2 below. First we introduce the notion 
of initial data. Denote by u(x , t) a solution of the wave equation in the trapezoid. The 
pair of functions 


u(x,t\), u t (x.t\), a + ct\ < x <b-ct\ (9.8) 

are called the initial data of u at time t\. 


Theorem 9.2. Let u be a solution of the wave equation 

u tt - c 2 u xx = 0 

in the trapezoid D where a + ct < x < b - ct, t\ < t < Suppose the initial 
data ofu at time t\ are zero. Then u(x,t) is zero in the whole trapezoid. 


Before giving the proof of Theorem 9.2 we show how it implies Theorem 9.1. 

Proof, (of Theorem 9.1) Let u be a solution of the wave equation in D. We show 
first that there is a solution v of the wave equation of the form (9.7) whose initial 
data are the same as the initial data of u. We construct two functions / and g that 
satisfy the relations 

u(x,t i) = f(x-cti) + g(v + ct\), u t (x,t\) = c(g(x + ct\) - f(x-ct\)) (9.9) 

for a + ct\ < x < b- ct\. Differentiate the first equation with respect to x and add 
~ c times the second equation to get u x + ~ c u t - 2 g', or subtract £ times the second 
equation to get u x -^u t = 2/'. Therefore 


g'(x + ch)=\u x (x,t\)+±- c u t (x,h), f'(x-ct\)= \u x (x,h)~ ±u t (x,ti), (9.10) 

from which / and g can be determined by integration. These formulas show that, 
since u is twice differentiable, so are / and g. 

Define the function v as 


v(x.t) = f(x - ct) + g(x + ct). 


( 9 . 11 ) 
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Then v is a solution of the wave equation in D. It follows from (9.9) that the initial 
data of v and u at time t\ are equal: 


v(x,t\) = u(x,t\), v t (x,t\) = u t (x,t\). (9.12) 

Next define w as the difference of u and v, 


w(x , t) = u(x , t) - v(v, t). 

Being the difference of two solutions, w also is a solution of the wave equation. It 
follows from (9.12) that the initial data of w are zero: 

w(x, t\ ) = 0, w t (x , t\ ) = 0. 

Therefore according to Theorem 9.2, w = u - v is zero in the trapezoid. This shows 
that u = v in the trapezoid; therefore u is of the form (9.7). □ 

We turn now to the proof of Theorem 9.2. 

Proof. Multiply the wave equation in Theorem 9.2 by u t and integrate over the 
trapezoid D\ we get 

I ( u t u tt - c 2 u t u xx ) dxdt = 0 . 

Jd 

We observe that the integrand is a divergence with respect to the (x, t) variables in 
that order: 


0 = u,(u tt - C l u xx ) = (: \uj) t -c z (u t u x ) x + c 2 u lx u x 
= (~c 2 u t u x ) x + \(u 2 t + c 2 u 2 x ), 

= div ( — c 2 iifU x , u 2 + c 2 u 2 )). 

Using the Divergence Theorem for F = (- c 2 u t u x , \(u 2 + c 2 u 2 )) and divF = 0 we get 


0 -f 


divF dxdt 


= I (-c 2 u t u x ,^(u 2 + c 2 u 2 ))-Nds 
JdD 

= - I ^(u 2 + c 2 u 2 )dx+ I ^(u 2 + c 2 u 2 )dx 

Jc 1 Jc 2 

+ I (~c 2 u t u x ,^(u?+ c 2 u 2 ))- Nds 

Jc 3 

+ I (~c 2 u t u x , \(u 2 + c 2 u 2 )) -Nds. 

Jc 4 

We have used N = (0, -1) on C\ and N = (0,1) on C 2 . Denote the integrals over C\ 
and C2 as E(t\) and Efa). Using N = l ^ (1 ,c) on C3 and N = y = (-1 ,c) 


on C 4 we get 


vr 


+ c z 


vr 


+ c z 
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l(-C 2 U t U x ) + ci(w? + C 2 ul) 

-F=^- ~ ds 

c 3 vr+^2 

, r -1 (-c 2 u t u x ) + 4(uf + c 2 u^) d __ Q 

Jc 4 VTT? 

The integrands in both integrals in (9.13) are perfect squares, \c(u t ± cu x ) 2 . There¬ 
fore both integrals are nonnegative. This proves that Efa) - E{t\) is nonpositive. 
Therefore 

E(t2)<E(t x ) (9.14) 

Denote by C(t) the portion of the string represented by a horizontal line segment 
drawn across the trapezoid at time t. The term I ^u 2 dx is the kinetic energy of 

Jc(t) 

the C(t) portion of the moving string, and the term I \c 2 u 2 dx is the elastic energy 

JC(t) 

in the C(t) portion of the stretched string. Their sum is the total energy stored in the 
C(t) portion of the string. So inequality (9.14) says that the energy stored in the C(t) 
portion of the string is a decreasing function of time. 

In particular if E(t\) is zero, it follows that so is Efo)- Since £fe) is the integral 
of u 2 + c 2 u 2 over C 2 , it follows that if E(t 2 ) is zero, u x and u t are zero in C 2 . 

Since the argument applies to every value of , it follows that u(x,t) is constant 
in the whole trapezoid. Since u is initially zero, that constant is zero. This completes 
the proof of Theorem 9.2. □ 


Example 9.3. We solve 

u tt = c 2 u xx , w(v,0) = 0, u t (x, 0) = cos(3v). 
According to Theorem 9.1 we can express 

u(x , t) = fix - ct) + g(x + ct). 

By equation (9.10) in the proof the data give 

g'(x) = cos(3v), f\x) = ~ cos(3v). 

2c 2c 

Integrating, we get 

g(v) = sin(3v) + ci, f{x) = - sin(3v) + C 2 . 
6c 6c 

Since 0 = u(x , 0) = fix) + g(v) = we get c\ - C 2 , and 

u(x , t) = -E sin (3(v - ct)) + ^ sin (3(v + ct)). 


□ 
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Example 9.4. We imagine a sound wave bouncing off a wall at v = 0 where 
the displacement u is zero. Take f(x) = -g(-x) in the wave equation solution 
u(x , t) = f(x - ct) + g(x + cO, so that 

u(x , 0 = -g(-x + cf) + g(v + ct). 

Then w(0,0 = -gict) + g(cO = 0 for all Suppose g(v) is zero on all but some 
positive interval, as in Figure 9.6. The resulting solution u can be viewed, for 
v > 0, as a wave g traveling left to a wall at v = 0, then reflected to the right, 
as an echo. □ 


g(x) 



X 



f(x) 


X 


u(x,0) 



0 



X 



The string with tied ends. We turn now to studying the motion of a stretched string 
that is tied at its two ends, located at v = 0 and at x = a(a> 0). See Figure 9.7 . The 
displacement of the ends is zero; we express this as the boundary conditions 

u(0,t) = 0, u(a,t ) = 0 for all t. (9.15) 


Suppose u is of the form 


u(x , t) = f(x - ct) + g(x + ct) 

and satisfies the boundary conditions. 

The boundary conditions state that 


f(-ct) + gict) = 0, f(a - ct) + gia + ct) = 0. 
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t 



0 


a 


Fig. 9.7 The band 0 <x<a, -oo <t < oo corresponds to locations on the string at all times. 


Denoting ct as x, the first relation says that f(-x) = -g(x). Setting this into the 
second relation we get 

-g(x -a) + g(x + a) = 0. 

Denoting x - a as y we rewrite this as 

g(y + 2a) = g(y). 

In words: g is a periodic function with period 2a. 

Since f(y) = -g(-y), it follows that also / is periodic with period 2a. Therefore 
u(x , t) = f{x - ct) + g(x + ct) is a periodic function of t with period . 

A function with period p also has periods 2p, 3p, and so forth. Thus a string that 
vibrates with period ^ also vibrates with period ^, n any whole number. 

Example 9.5. The functions 

sin(v)cos(c0, sin(2v)cos(2c0, sin(3x)cos(3 ct), 

and so on are vibrations of a string with ends tied at x = 0 and x-n. □ 

The frequency of vibration is the reciprocal of the period; so the lowest frequency 
is ^. We recall from our derivation of the wave equation that c 2 = So we get the 
following theorem. 

Theorem 9.3. A string of length a and weight W per unit length stretched 
with tension T can vibrate with the frequency ^ and each whole number 
multiple of this. 


This formula shows that the lowest frequency of vibration increases if 

(a) the tension in the string is increased, 

(b) the string is shortened, 

(c) the string is replaced by a thinner string. 
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All vibrations of a stretched string are multiples of the lowest frequency; it is this 
property that makes string instruments musical. We show in the next section that, in 
contrast, a vibrating elastic sheet does not have this property. 


Problems 

9.1. Suppose u is a solution of u tt - c 2 u xx = 0, and that at a particular time t , the 
graph of u as a function of x is convex ( u xx > 0). Is the acceleration u tt of the string 
up or down? 

9.2. A violin A-string has length 330 [mm] and vibrates at 440 cycles per second. 
Let A 4 = 27t( 440) and 

u(x,t) = sin(^)cos(A40. 

Find the value of so that u tt - ^ u xx = 0, 

9.3. A violin A-string produces a vibration 

u(x , t) = c\ sin (^) cos(A40 + C 2 sin () cos(E^t) 
where c\ and C 2 are some constants and u tt - c 2 u xx = 0. Find > 0 in terms of A 4 . 

9.4. Show that the function u(x , t)=A sin(bx)cos(bct) is a string vibration with ends 
tied at 0 and tt, that is, 


u tt - C 2 u xx = 0 , 

for certain values of the constants A, 
integer. 

9.5. At what speed and direction (left 

(a) cos(x + 30 

(b) 5 cos(v + 3 1) 

(c) -7 sin(£ - Ax) 

9.6. Show that the following functions are solutions of u tt - c 2 u xx = 0. 

(a) cos(x + ct ), 

(b) u(kx, kt) if k is a constant and u(x , t) is a solution, 

(c) au(x , t), if u is a solution and a a number. 

(d) u\ + U 2 , if u\ and U 2 are solutions. 

(e) sin(2v - 2 ct) + cos(3v + 3 ct) 

9.7. Verify that every function of the form 


u( 0, t) = 0, u(n , t) = 0, 

b. Show that A is arbitrary but b must be an 

or right) do these waves move? 


u(x , t) = f(x - ct) + g(x + ct) 
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where / and g are twice differentiable functions of a single variable, is a solution of 
Utt - c 2 u xx = 0 . 

9.8. Consider the string vibrations 

u\(x,t) = sin(x- 2 1) 

U 2 (x,t) = sin(x + 20 
us(x,t) = u\(x,t) + U2(x,t). 

(a) Sketch graphs of each one as a function of x, for times t- 0,1,2. 

(b) Which one moves to the left and which to the right? 

(c) Show that 1/3 (0,0 = 0. 

(d) Find values of a that give u?,(a, 0) = 0, and then show that for those a, u?,(a, t) = 0 
for all t. 

9.9. Assume 2c < n. Find the energy 

rb 

E(t) =\ I (u 2 -\-c 2 u 2 ) dx 
Ja 

of the vibration u(x, t) = cos(v + ct) for: 

(a) the interval [< a , b] = [ 0 , n] at time t- 0 , 

(b) the interval [a, b] = [ct, n - ct] at time t = 1 . 

Which is larger? 

9.10. Take the case of a string with tied ends, u( 0, t) = u(a, t) - 0. 

(a) Why is u t also zero at the ends? 

(b) Show that the energy in [0 ,a] is conserved, that is, E'(t) = 0, where 

E(J)-\ J* (u? + c 2 u 2 )(x,t)dx. 

9.11. Find a solution of each problem in the form 

u(x, t) = f(x - ct) + g(x + ct). 

(a) u tt - c 2 u xx - 0 with u(x, 0 ) = sinx and u t {x, 0 ) = 0 . 

(b) u tt -u xx - 0 with u(x, 0 ) = 0 and u t (x,0) = cos( 2 x). 

(c) u tt - 25 u xx = 0 with u(x, 0) = 3 sin v + sin(3x) and u t (x, 0) = cos(2v). 

9.12. Suppose the wave g(x + ct) in Example 9.4 is a brief sound made by an 
observer who waits for the echo -g(-x-\-ct) to return, and notes the time t\ elapsed. 
Show that the distance to the wall is \ct\, and can be determined using any brief 
sound g (as one knows of echos). 
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9.13. For x > 0 let u(x,t) be a solution of the wave equation as in Example 9.4. Let 
p be a positive number. Show that the energy remaining near the wall, 

0 u 2 + c 2 u 2 x ) dx, 

is zero for large values of the time. 



9.2 Vibration of a membrane 


Consider an elastic membrane stretched over a frame in the x,y plane. When the 
membrane is displaced in the direction perpendicular to the x,y plane and then 
released, it will vibrate in the perpendicular direction. We derive the differential 
equation governing this vibration and study its solutions. 




Fig. 9.8 Left: A small portion of the membrane. Right: We approximate the perpendicular compo¬ 
nent of the elastic force on the a + h edge as hT sin 6 ~ hT tan 6 = hTz x (a + h,y, t ). 


Denote by z(x,y,t ) the displacement at time t of point (x,y) of the membrane in 
the perpendicular direction. See Figure 9.8. We study the motion of a small portion 
of the membrane 


(x,y,z(x,y,/)), a<x<a + h, b<y<b + h, h small. (9.16) 

The motion of this portion is driven by the elastic forces acting on its four sides. 
The elastic force acting on an edge of this portion of the membrane lies in the tan¬ 
gent plane of the membrane and is perpendicular to the edge on which it acts; its 
magnitude is some constant T times the length of the edge. We shall study small 
vibrations, that is where the displacement z(x,y,0 and its derivatives z* and z_y are 
small. The vibrations are produced by forces in the direction perpendicular to the x, 
y plane. We analyze them similarly to the forces on a piece of string in Section 9.1. 
In this case the component of the force acting on the side x-a is well approximated 
by -hTz x (a,y, t) and the force on the side x = a + h is equally well approximated by 
hTz x (a + h,y, t). 
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The resultant of these forces is hT(z x {a + h,y,t) - z x (a,y,t)), and for h small is 
well approximated by h 2 Tz xx . Similarly the forces acting on the sides y = b + h and 
y = b, in the direction perpendicular to the x, y plane, are well approximated by 
h 2 Tz yy . The sum of these forces is h 2 T(z xx + z yy ). The mass of the portion of the 
membrane is h 2 p , where p [mass/area] is the density of the membrane. Therefore 
the equation governing the motion of this portion of the membrane, using Newton’s 
law that total force = mass times acceleration, is 

h 2 T(z xx + Zyy) = h 2 pztt, 


which we rewrite as 

Ztt = C 2 (Zxx+Zyy) (9.17) 

where c = This equation is called the two-dimensional wave equation . 

We investigate now some simple motions of a membrane spanning the square 

0<v<7r, 0<y<7r. (9.18) 

The membrane is fixed at the boundary of the square, therefore 

z(x,y) = 0 

on the boundary of the square. Define 

Zi (x,y, t ) = sin( yflct) sin(v) sin(y), 

Z2(x,y, t) = sin( V5 ct) sin(v) sin(2y). 

A simple calculation shows that both z\ and zi are solutions of the wave equa¬ 
tion (9.17), and they are zero on the boundary of the square (9.18). The solution z\ 
is periodic in time, with period -^=-, and the solution zi is periodic with period -^=-, 

and all integer multiples of these periods. Since no integer multiple of -^=- is equal 
to an integer multiple of . It follows that the sum 


Zl +Z2, 

that is a solution of the wave equation, is not periodic in time. It can be shown that 
only very special solutions of the two-dimensional wave equation are periodic in 
time. 

We express this result as follows: Vibrations of one-dimensional elastic systems 
are periodic in time, but the vibrations of two-dimensional systems are in general 
not periodic in time. This explains why all musical instruments are essentially one¬ 
dimensional vibrating systems. Violins and cellos use vibrating strings to generate 
sound, wind instruments like flutes and clarinets use vibrating thin columns of air to 
generate sound. One can point to drums as a truly two-dimensional instrument; but 
the sound of a drum of is muffled, without a definite pitch! 
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Problems 

9.14. Show that the following functions are solutions of the wave equation 

Ztt =C 2 (Zxx + Zyy). 

(a) X 2 -J 2 

(b) cos(ct) cos(x) 

(c) cos(ct) sin(y) 

(d) sin( V2ct) cos(x + y) 

9.15. Show that the following functions are solutions of the wave equation 

Ztt — C (Zxx Zyy)- 


(a) u + v, if u and v are solutions. 

(b) kz , if k is a constant and z is a solution. 

(c) z(-y,x,t ) if z(x,y,t ) is a solution, i.e., rotate n/2. 

(d) z(kx,ky,kt ) if z(x,y,t ) is a solution. 

9.16. Show that these functions are solutions of the wave equation 

Ztt — C (Zxx Zyy)- 


(a) cos(y + ct) 

(b) cos(x + ,y + y/lct) 

(c) cos(v - 2y - V5 ct) 

9.17. We have said that the function 

Z\ + Z2 = sin( V2 ct) sin(v) sin(y) + sin( V5 ct) sin(v) sin(2y) 
is not periodic in t. Show that the function 

sin(1.414c0 sin(x) sin (y) + sin(2.236c0 sin(x) sin(2};) 

2 n 

is periodic, repeating every 1000 — seconds. 

c 

9.18. Suppose a tension T [force/length] causes a force perpendicular to any short 
edge drawn in an elastic membrane, the force coplanar with the material. For exam¬ 
ple there are forces on three edges in the plane of the triangle in Figure 9.9. Denote 
by a and k the angle and hypotenuse length indicated. Show that the forces are as 
follows. 

(a) The force on the right edge is (Tksina,0). 

(b) The force on the bottom edge is (0, -Tk cos a). 

(c) The force on the hypotenuse is Tk(- sin a, cos a). 

(d) The net force on the triangle is the zero vector. 
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This is why we assume a constant T in the derivation of the wave equation (9.17). 



Fig. 9.9 Forces on a triangular portion of the membrane in Problem 9.18. 


9.19. Let n and m be positive integers, and 

z(x,y, t) = cos( y/n 2 + m 2 t) sin(ftx) sin (my). 

(a) Show that z is a solution of the membrane vibration equation Ztt = Zxx + Zyy that is 
zero on the boundary of the square 0 < x<n, 0<y <n. 

(b) Show that the number of such solutions that have frequency Vft 2 + m 2 < 1000 is 
roughly ^7r(1000) 2 . 

9.20. Take the case c = 1 so that the wave equation becomes ztt = Zxx + Zyy, and the 
solutions zi and Z2 in the text become 

z i = sin( V 2 1) sin(x) sin(y), zi - sin( V5 1) sin(v) sin( 2 y). 

(a) Suppose / is a twice differentiable function of one variable. Define 

z(x,y, t) = f(ax + by +1), 

which is called a traveling wave. What is required of the constants a and b so 
that z is a solution of the wave equation? 

(b) Verify that sin(w) sin(v) = -^(cos(w + v) - cos (u - v)). Use that and similar iden¬ 
tities to express Z 2 as a sum of four traveling waves. 

(c) The solution zi +Z 2 can be expressed as a sum of how many traveling waves? 

9.21. Let z(x,y, t) be a function of the form 

z(x,y,t) = f(r)sin(kt) 

where r = a/x 2 +y 2 and k is a constant, so that z depends only on the time and the 
distance to the origin. Show that z is a solution of the wave equation Ztt - z X x + Zyy if 
/ satisfies 

f'\r) + -f\r) + k 2 f{r ) = 0. 

r 
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9.22. The wave equation for a function u(X , t) in three space dimensions is 

u tt - c Au. 

(a) Find constants k so that the function u(X , t) = cos(2xi + 3 x 2 + 6 x 3 + kt) is a solu¬ 
tion of the wave equation. 

(b) Find the constant vectors A so that C 2 functions of the form 

u(X, t) = /(A -X±ct) 
are solutions of the wave equation. 



Fig. 9.10 A spherically symmetric solution of the wave equation in R 3 approaches the origin. See 
Problem 9.23. 


9.23. Let p = a/x 2 + y 2 +z 2 in R 3 . For waves u(p , t) that are spherically symmetric 
about the origin the wave equation u tt = Au becomes 

_ ,2 

Utt — Upp + p Up. 

? f(P — 0 

(a) Show that C functions of the form u(p , t) = - are solutions of the wave 

P 

equation for p > 0 . 

(b) Suppose a solution has the form 


u(p,t) 


fip + t) 
P 


where f(p ) is nonzero only in a small interval near p = 100 , and the maximum 
value of u(p, 0) is 1. See Figure 9.10. Approximately what is the maximum value 
of u(p , 99) and where does it occur? 
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9.3 The conduction of heat 

In this section we show that the distribution of temperature in a medium that con¬ 
ducts heat satisfies a partial differential equation. It is a conservation law as we have 
discussed in Chapter 8. 

We start with the one-dimensional case, the conduction of heat in a rod. Let the 
rod be on the interval [0, a] along the v axis. We denote by T(x, t) the temperature at 
the position v and at time t. Assume T is a C 2 function. We assume that heat energy 
density is proportional to temperature, so that the heat energy in the section [ b , c] of 
the rod at time t is 



(9.19) 


where p is some positive constant. 

Next we assume that heat energy is conducted from a hotter to a colder region, at 
a rate proportional to the gradient of temperature. Here “gradient” refers to the rate 
of change of temperature with respect to position, so in the one-dimensional case, 
to T x . Therefore heat enters the section [b,c] at its endpoints, at the rates 



(9.20) 


where r is some positive constant. The energy conservation law states that the rate 
at which heat flows across the boundary of [ b , c ] is the time derivative of the total 
heat energy in [b,c]. 



(9.21) 


On the left side we carry out the differentiation with respect to t under the integral 
sign; we get 



The right side is a difference of the values of the function 


between x - c and x-b. We express it as the integral of its derivative; we get 



We see that this is in the conservation law form of equation (8.25), 
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if we take / = -rT x and p = pT. We rewrite it as 

X 


p ~g^^ x ’ t ^~ r ~^2 ( ' x ’ t ^ dx = 0 - 


Since this integral is zero over all intervals [ b , c], it follows that the integrand is zero, 

dT d 2 T 

p T,- r w =a 


We rewrite this equation as 

T t -hT xx = 0, (9.22) 

r 

where /z = — is a positive constant. Equation (9.22) is called the equation of heat 
conduction. 

Example 9.6. Verify that the function 

T(x,t ) = e~ ht sin x 

is a solution of the heat equation (9.22). We have 

T t (x,t) = -ht~ ht sinx = - hT(x,t) 


and since T xx (x,t) = -T(x,t), this is a solution. It decays toward zero as time 
increases, due to the exponential. See Figure 9.11. 

Since T x = t~ ht cos v the flow of heat energy is to the left at v = 0, is zero at 
x = |, and is to the right at v = n. □ 



Fig. 9.11 Graphs of the heat solution T(x,t ) = e ht sinx as a function of x at times t\ <ti. See 
Examples 9.6 and 9.7. 


Example 9. 7. Suppose T is a solution of the heat equation, such that at some 
time t the graph of T as a function of v is convex, T xx > 0. It follows from the 
heat equation that T t > 0. Therefore T(x,t) is an increasing function of t. See 
the left half of Figure 9.11 where T is convex. □ 

We derive now some of the basic properties of solutions of the equation of heat 
conduction in a rod whose endpoints are kept at some constant temperature. 

Since the heat equation only involves the derivatives of T, it follows that if T(x, t ) 
is a solution of the heat equation, so is T (v, t) - k , where k is a constant. Choose k to 
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be the constant temperature at the endpoints; then T - k is zero at the endpoints. So 
it suffices to study solutions of the heat equation that are zero at the endpoints of the 
rod. 

We show now the following property of such solutions: 

Theorem 9.4. Let T be a solution of the heat equation T t - hT xx = 0 for x in 
[0, a] that is zero at the endpoints. Then the maximum ofT(x , t) over x in [0, a\ 
is a decreasing function oft. 


T(0,t) = 0 



T(a,t) = 0 


Fig. 9.12 Temperature T(x, t) in a rod 0 < x < a. 


Proof. We show first that for t>0, T(x, t ) is never larger than its maximum at t = 0. 
We argue indirectly; suppose for some time s > 0 and some y in (0 ,a), T(y,s ) is 
larger than the maximum of r(x,0). Then there are positive numbers M and e such 
that for all v in [0 ,a\, 


T(x,0) < M < M + ea 2 < T (y, s). 

Let R be the rectangular set [0, a] x [0, s] and consider the function 

u(x , t) = T (x, t) + e(x - y ) 2 

on R. Since T(x, t) is zero at the endpoints and T (v, 0) < M, 

u(0 , t) = T (0, t) + ey 2 < ea 2 
u(a , t) = T (< a , t) + e(a - y) 2 < ea 2 

u(x , 0) = T(x, 0) + e(x -y) 2 <M + ea 2 . (9.23) 

Since 

u(y , s) = T(y, s) > M + ea 2 

the maximum of u on R is at least T(y, s ), and according to (9.23) it must occur at 
some point (v, t ) with t > 0 and v not at either endpoint of the rod. Therefore at this 
maximum we have u xx < 0 and u t > 0. Then 
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0 > u xx h^t — T xx + 26 j^Tt — 26. 

This is a contradiction since e > 0. 

To complete the proof suppose 0 < t\ < and set v(v, t)-T (x, t\+t). Then v is a 
solution of the heat equation. Let v(c,0) be the maximum of v(v,0) on [0 ,a\. Then 


max T(x, t^)- maxv(x, t 2 ~h) 

X X 

<v(c,0) = T(c,h) 

< ma xT(x,t\). 

X 


□ 


Since -T(x,t) is a solution as well of the heat equation that is zero at x = 0 
and x = a, it follows that the minimum value of T(x, t) over all v is an increasing 
function of t. Combining the two results we deduce from Theorem 9.4 the following 
corollary. 


Corollary 9.1. Let T be a solution of the heat equation T t - hT xx = 0 for x 
in [0, a] that is zero at the endpoints. Then the maximum with respect to x of 
\T(x,t)\ is a decreasing function oft. 


We can now prove the uniqueness theorem : 

Theorem 9.5. Suppose T\ and T 2 are two solutions of the heat equation in 
[0, a\, t > 0, that are equal at t = 0, equal at the endpoint x - 0 and equal at 
the endpoint x- a. Then Tfx,t) = T 2 (x,t)for all t> 0 and x in [0 ,a]. 


Proof. Set T = T\-T 2 . Then 


T t -hT xx = 0 

r(*,o) = o 

7X0,0 = 0 
T (a, 0 = 0. 

According to Corollary 9.1, \T(x,t)\ decreases from its initial value. But that is zero. 
So T is identically zero, and T\ - TX- □ 

Examples of the heat equation in higher dimensions The conduction of heat in a 
plate can be analyzed similarly to the way we analyzed the conduction of heat in a 
rod; we look at the flow of heat into a small portion of the plate b <x<c, d <y <e 
across its boundary. We obtain an equation analogous to T t — hT xx = 0, 


T t -hAT = 0 , 


( 9 . 24 ) 
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where AT - T xx + T yy . In Problem 9.31 we outline a derivation of this equation using 
the Divergence Theorem. 

Example 9.8. Define T ( x,y , t) - eT at sin (bx + cy). We find the relation of a , b , c 
so that T is a solution of the equation T t -hAT = 0. 

T t -hAT - -aoT at sin (bx + cy) - hoT at (-b 2 - c 2 ) sin (bx + cy) 

= (- a + (b 2 + c 2 )h)T. 

That is zero if a = (b 2 + c 2 )/l So 

Q -{b 2 +c 2 )ht + c -y) 

is a solution of the heat equation for all numbers b, c. □ 

Example 9.9. You have seen in Problem 4.7 that the function 

T(X,t) = (4;rO“ w/2 e" l|X||2/(40 

is a solution of the heat equation in n space dimensions, 

T t -AT = 0. 

□ 


Problems 

9.24. Show that these functions are solutions of the heat equation T t = hT xx . 

(a) mx + k for all constants m, k 

(b) T\ + T2 if T\ and T2 are solutions. 

(c) Q~ ht COS(x) 

(d) some functions e“^sin(mv); find the relation between k and m, 

(e) u(x,ht) where u(x,t) is any solution of u t = u xx . 

9.25. Show that these functions are solutions of the heat equation T t = T xx . 

(a) e - " 2 *sin (nx) for n- 1,2,3,..., 

(b) t p Q~ x2 ^ 4t ^ for a certain exponent p\ find p , 

(c) Q~ ax cos (ax - bt) for some constants a , b\ find the relation between a and b. 

(d) u(kx,k 2 t) for any constant k if u is a solution. 


9.26. The temperature at a moderate distance v below the ground is modeled as 

T (x, t) = oT ax cos {ax - bht) 
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Fig. 9.13 Earth in Problem 9.26. 



where the value of the constant b depends on whether we are discussing daily (sun¬ 
rise, sunset) or seasonal (winter, summer) variations. See Figure 9.13. 

(a) Show that T t - hT xx if b- la 2 . 

(b) Is b larger for daily or seasonal variations? 

(c) Considering the factor e -ax , do daily or seasonal variations penetrate deeper into 
Earth? 

9.27. A rod is held at zero temperature at the ends, so that 

T t = T xx , r(f,0) = r(f,7T) = 0. 

(a) Fill in the missing numbers so that 

T (v, t) = (?)e (? ^ sin v + (?)e (?)r sin(2v) 

is a solution with initial value T(v,0) = sinv+ \ sin(2x). 

(b) Sketch the graph of T as function of v at times t = 0, \, 1. 

(c) Which of the two terms decreases faster as t increases? Note the maximum tem¬ 
perature is located left of the center point of the rod initially. Does the hot spot 
move to the left or right as time increases? 

9.28. We can calculate approximate solutions of the heat equation T t = T xx by 
approximating the derivatives by difference quotients, where At and Ax are posi¬ 
tive numbers, 

T(x,t + At ) -T(x,t) T(x + Ax,t) - 2 T(x,t) + T(x-Ax,t) 

At (Ax) 2 

(a) Solve that approximation for T(x,t + At) in terms of the earlier values at time t. 

(b) Take = \. Show that the approximation becomes 


T(x,t + At) = \(T(x-Ax,t) + T(x + Ax,t)). 
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(c) Initial values T (x, 0) are given in the table. Fill in values of T for five time steps 
using the approximation in (b). One of the values, T(-Ax,At) = ^(16 + 0) = 8 , is 
indicated as an example. 


5 At 
AAt 

3 At 
2 At 

At 

0 

0 

0 

0 

0 

0 

8 

0 

16 

16 

0 

0 

0 

0 

0 

0 


- 6 Ax 

-5 Ax 

-4zfx 

-3 Ax 

- 2 zfx 

-Ax 

0 

Ax 

2 zlx 

3Ax 

4Ax 

5 Ax 

6Ax 

7 Ax 


9.29. A simple model for temperature y(t) of an object at time t in an environment 
of temperature s is Newton’s Law of Cooling, 


y' = -k(y-s ) 

where k is a positive constant. Consider two heat equation solutions 

T i (x, t) = e _? sin(x), 7X(x, t) = e _r sin(x) + e~ 9t sin(3x) 


for a rod located in the interval [ 0 , 7 r]. We imagine the environment s = 0, and define 
the average temperature of the rods to be 

yi(0=- f Ti(x,t)dx, y 2 (t)=- f T 2 (x, t)dx. 
n Jo n Jo 

Show that Newton’s Law of Cooling holds for y\ but not for y 2 . 


9.30. A steady state solution of T t — hT xx is one that does not vary with time, thus 
T xx = 0 . 

(a) Find a steady state temperature function T (x, t) for 0 < x < a with L(0, t) = 50 and 

7X10,0 = 100. 

(b) Verify that for your steady state function, the total rate that heat enters (see equa¬ 
tion (9.20)) at left and right ends is zero, 

dT d T 

~r —( 0,0 + r ——(< 2,0 = 0 . 
ox ox 


(c) Verify that the total heat energy (see equation (9.19)) for your steady state func¬ 
tion, 

pT (x, 0 dx 

is independent of the time. 



9.31. In this problem you can derive the heat equation in 2 space variables 


T t -hAT = 0. 
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The analogue of the heat conservation law (9.21) states that for all regular sets D in 
the plane, 


^ pT(x,y,t)dA = r C } 

dt Jo JdD 


VT-Nds. 


Justify the following items. 

(a) The rate of change of total energy is I pT t dA. 

Jd 


(b) 

(c) 


f pT t dA= f div(rVT)dA. 
Jd JdD 

f (pT t -rJT) ds = 0. 

Jd 


(d) T t -hAT = 0, where h = £. 

9.32. Suppose a solution of the two-dimensional heat equation has the form 


T(x,y,t) = e~ cht f(r), 


where r is the polar coordinate r = \lx 2 + x 2 . Thus T only depends on time and on 
the distance to the origin. Substitute into T t = hAT and use the Chain Rule to show 
that / satisfies 

f"(r)+-f(r) + cf(r) = 0. 
r 

9.33. Show that T(x,y,t ) = x 2 +y 2 +4 ht is a solution of the two-dimensional heat 
equation T t = hAT , and describe the direction of heat conduction. 


9.4 Equilibrium 


In Section 9.2 we derived the differential equation governing a vibrating membrane, 

Ztt — C (Zxx + Zyy)’ 

and in Section 9.3 we derived the differential equation governing the flow of heat in 
a plate, 

T t - h(T xx + Tyy). 

We consider now the case of equilibrium, that is membranes in which the elastic 
forces are so balanced that they do not vibrate, and heat-conducting bodies in which 
the temperature is so balanced that it does not change. 

The equations of equilibrium can be obtained from the equations of time change 
by simply setting the time derivatives in these equations equal to zero. So we obtain 
from the equation of of a vibrating membrane the equilibrium equation 


£xx + %>’ - 0 , 
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and from the equation of heat conduction the equilibrium equation 


T X x + T yy - 0. 


We observe, with some astonishment, that except for the symbols used, these equa¬ 
tions are identical. The equation 


An = 0 


is called the Laplace equation and the solutions are known as harmonic functions. 
There is no physical reason why the equilibrium of an elastic membrane and the 
equilibrium of heat distribution should be governed by the same equation, but they 
are, and so 

Their mathematical theory is the same. 

This is what makes mathematics a universal tool in dealing with problems of 
science. 

We state and prove an important property of solutions of the Laplace equation. 

Theorem 9.6. Let u and v he two solutions of the Laplace equation on a con¬ 
nected regular set D in R 2 that are equal on the boundary of D. Then u and v 
are equal in D. 

Another way of expressing this theorem is: Solutions of the Laplace equation in 
a regular set D in R 2 are uniquely determined by their values on the boundary of D. 
We give now a mathematical proof of this proposition. 

Proof. Denote by z the difference of u and v, 


z(x 9 y) = u(x 9 y)-v(x 9 y). 


Since u and v are solutions of the Laplace equation, their difference z is a solution. 
Since u and v are equal on the boundary of D , z is zero on the boundary. Multiply 
the Laplace equation Az = 0 by z and integrate the product over D. We get 



(9.25) 


Using the product rule for div we get 


div (zVz) = zAz +1| Vz|| 2 . 


(9.26) 


Therefore 



Since z - 0 on dD we get from the Divergence Theorem that 
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Therefore 


■I 


(zl + z 2 y )&x&y = 0. 


(9.27) 


One says that (9.27) has been obtained from (9.25) using “integration by parts” and 
the product rule for div in (9.26). 

The integrand in (9.27) is a sum of squares and therefore nonnegative. Since the 
integral is zero, so is the integrand; therefore 


Zx — 0, Zy — 0 in D. 

A function z whose partial derivatives are zero in D is a constant in D\ since z is 
zero on the boundary, that constant is zero. Since z-u — v , this proves that u and v 
are equal in D , as claimed. □ 

We observe that the uniqueness result in Theorem 9.6 is intuitively clear if we 
interpret z as the displacement of an elastic membrane. For if the boundary of the 
membrane is constrained to lie in the plane z = 0 , then the whole membrane will lie 
in that plane. 

We note that Theorem 9.6 and its proof can be extended to functions of three, or 
any number, of variables. 

A further basic result about the Laplace equation in D is that given any contin¬ 
uous function on the boundary of D , there is a solution of the Laplace equation in 
D with these prescribed boundary values. The result is plausible, for if we stretch a 
membrane over a frame on the boundary of D, the membrane will take on a shape in 
equilibrium. The proof of this proposition is beyond the scope of a calculus book. 


Problems 

9.34. Show that these functions are solutions of the Laplace equation u xx + u yy = 0. 

(a) x 2 -y 2 

(b) x 3 - 3xy 2 

(c) e _GX sin (by) for some constants a , b\ find the relation between a and b. 

9.35. Show that these functions are solutions of the Laplace equation u xx + u yy = 0. 

(a) u\ + U 2 if u\ and U 2 are solutions 

(b) u(xcos6-ysin6,xsin6+ ycos6) if u is a solution and 6 is a constant angle, i.e., 
rotate the solution. 

(c) the product uv of two solutions, if the gradients Vu and Vv are orthogonal. 

9.36. Figure 9.14 shows a region bounded by two level sets of T and two curves 
tangent to VT, for a function T with Txx + T yy - 0. Justify the following items. 

(a) The curves are orthogonal at the corners. 

(b) The flux of VT toward the right is the same across the two level sets. 
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Fig. 9.14 The region in Problem 9.36. 


(c) The flux of VT is zero across each of the other two curves. 


9.37. Show that z(x,y) = log(x 2 + y 2 ) is a solution of the Laplace equation Az = 0. 

9.38. Recall that the Laplacian of /, Af in any number of dimensions is divV/. 
Show that zl(||X|| _1 ) is zero in three dimensions, but not in 2 or 4 dimensions. 

9.39. Suppose u , v and w are C 2 functions in a regular set D in R 3 . Derive item (a) 
from the Divergence Theorem for the vector field wVw, and justify the following 
items. 

(a) If w = 0 on the boundary of D , then I (Aw)wdV = - I Vw • VwdV. 

Jd Jd 

(b) If u solves Laplace’s equation 

Au = 0 in D 
u- 0 on dD 

then u(x 9 y,z ) = 0 at all points of D. 

(c) Suppose u and v are two solutions of the Laplace equation 

Au = 0, Av = 0 

in a regular set D that are equal on the boundary of D. Show that u and v are 
equal in D. [Hint: apply item (b) to u - v.] 

9.40. We derived the wave equation (9.17) from Newton’s F = ma law of motion, 
assuming that the only forces on a portion of the membrane were due to the tension. 
Suppose instead that the membrane is in equilibrium subject to a uniform pressure 
p [force/area] on the top surface. See Figure 9.15. Then the sum of edge forces on 
the portion a < x < a + h, b <y < b + h are balanced by the force due to p. Justify 
the following statements. 

(a) The upward force is - ph 2 when the slopes z x , z y are small. 

(b) z satisfies the differential equation + Zyy = f where T is the tension. 
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Fig. 9.15 Pressure above a membrane in Problem 9.40. 

9.41. Show that the following functions are solutions of a pressurized membrane 
equation z xx + % = 1 as in Problem 9.40 


(a) \{x 2 +y 2 ) 

(b) |^ + iy 2 


(c) kz + (1 - k)w , if z,w are solutions and k constant, 

(d) z + w, if z is a solution and w satisfies the Laplace equation w xx + w yy - 0. 

9.42. Here Au = u xx + u yy . Let h> 0, and denote the values of u at four points of the 
compass as 


ue =u(x + h,y) 
us =u(x,y- h) 
uw =u(x — h,y) 
un =u(x,y + h). 


See Figure 9.16. 

(a) Use Taylor’s theorem to show that 


1 rs 

Au(x,y ) = — (ue + us + uw + ujy -4u(x,y)) + 0(h). 
h 1 

(b) We use part (a) to approximate a solution of the equation 


An = 1 


in the square 0 < x < 1, 0 <y<l shown on the right side of the figure, with 
boundary values indicated. Use the approximation 



to set up a system of linear equations for values u\,U 2 ,u^,U 4 at the indicated 
points. 

9.43. Let z(x,y) = sin(nx)sinh(ny), where n is a positive integer. 

(a) Verify that Az = 0. 

(b) Verify that z = 0 on the boundary of the region 0<x<7r, y>0. 

(c) Show that z(x,y) is unbounded. 
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20 30 

40 

u = 50 

X 30 40 

Fig. 9.16 Notation for Problem 9.42. Left: part (a) Right: part (b). 


W 


u = 10 

20 


• 

• 

u 2 

U 4 

• 

• 

U 1 

U 3 


Remark: This does not happen with the heat and wave equations, and shows that 
we need to specify some boundary values all the way around the boundary for the 
Laplace equation. Or, as we may say, an initial condition cannot be specified. 

9.44. We consider the vector field Vu where u is a solution of the Laplace equation. 

(a) If Au = 0, show that Wu has divergence equal to zero. 

(b) Show that the functions x 2 -y 2 and 2xy are solutions of the Laplace equation. 

(c) Show that the vector fields F = V(v 2 -y 2 ) and G = V(2xy) have divergence zero 
and are orthogonal to each other at each point (v,y). 

(d) Show that at each point (x,y), the vectors F(x,y) and G(v,y) have the same length. 

(e) Sketch the vector fields F and G. These are simple models for the velocity of an 
incompressible fluid flow. 



Fig. 9.17 The vector field in Problem 9.45 models incompressible fluid flow around a cylinder. 

V 

9.45. Let u(x,y) = x+ —--. 

x z +y z 

(a) Show that u is a solution of the Laplace equation; therefore the vector field F = Vu 
has divergence equal to zero. See Figure 9.17. 

(b) Show that F is tangent to the unit circle, that is, for all points on x 2 + y 2 = 1, 


(x,y)-F(x,y) = 0. 
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(c) Show that as x 2 +y 2 tends to infinity, F(v,y) tends to (1,0). 
9.46. Suppose u(x,y,z) is a solution of the Laplace equation in R 3 

UXX Uyy + U ZZ = 0 . 

Define a vector field F = Vu. Show that divF = 0 and curlF = 0. 


9.5 The Schrodinger equation 


The Schrodinger equation is the basic equation of quantum mechanics. It is for 
complex valued functions if/ = / + ig where / and g are real valued functions of 
X = (x,y,z) and of t , and i 2 = -1. The function ^ = f -ig is called the complex 
conjugate of if/. Complex valued functions can be differentiated by treating i as a 
constant. 

The Schrodinger equation is of the form 

ii// t = —Aiff + Vifr (9.28) 


where V is a real valued function with the property that V(X) tends to zero rapidly 
as ||X|| tends to infinity. We consider solutions that tend to zero rapidly as ||X|| 
tends to infinity. 

The physics interpretation of solutions of Schrodinger’s equation is based on the 
following property of its solutions. 


Theorem 9.7. Let i// be a solution of Schrodinger’s equation whose partial 
derivatives tend to zero rapidly as ||X|| tends to infinity. Then the integral 



|^(X,/)| 2 d 3 X 


(9.29) 


is independent oft. 


Proof. We write 


L 


l*A(X, f)| 2 d 3 X = f 

JR3 


i/s(X,t)i/f(X,t)d 3 X and differentiate with 


respect to t. Since j/ and its t derivative tend to zero rapidly as ||X|| tends to infinity, 
we have 


d_ 
d t 


f W(X,t) | 2 d 3 X= f (^(X,f)iA(X,o)d 3 x 

Jr 3 J R 3 dC > 

= f (^(X,o^(X,o + ^x,owx,o)d 3 x. 

JR3 V 7 


We use the Schrodinger equation to express if/t and if/ t . Since V is real, the integral 
on the right becomes 
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fj - i( -^ +vw + «-^ + w)d 3 x -L (WW-M) d 3 X 


Since if/ and its first partial derivatives with respect to x, y and z tend to zero rapidly 
as X tends to infinity, we show in Problem 9.50 that the integral on the right side is 
zero. It follows that 

£ f l<KX,r)| 2 d 3 x = o. 

at J R 3 

This proves Theorem 9.7. □ 


We give now the physical interpretation of Theorem 9.7. Suppose that at t = 0 the 
function if/ satisfies 


It follows that for all t 


Let 


r 

Jr 3 

L 


|iA(X,0)| 2 d 3 X = 1. 


|^(X,r)| 2 d 3 X= 1. 


\ifr(X,t)\ 2 = p(X,t). 


For each t, p(X, t) is a nonnegative function on K 3 whose integral over R 3 is 1. Such 
a function is a probability density function. Suppose p is integrable with respect to 
X on a set S in R 3 . Then 

P(S,t) = J p(X,t) d 3 X, 

is the probability associated with the set S at time t. What is the physical interpreta¬ 
tion of this probability? According to quantum mechanics, P(S,t) is the probability 
that a particle governed by the Schrodinger equation is located at time t in the set S . 

This formulation is a radical philosophical departure from the Newtonian picture; 
instead of having a definite position in space, at each instant of time there is only a 
probability of a particle’s location. Many physicists had to struggle to accept such a 
probabilistic description, Einstein among them. He famously remarked “God does 
not play dice with the Universe.” But the great success of quantum mechanics has 
led to the universal acceptance by physicists of the probabilistic interpretation of 
solutions of the Schrodinger equation. 


Problems 

9.47. Let 0(X) be a solution of the equation 

Ecp = —A(f> + V(fi 


where E is a real number. 


418 


9 Partial differential equations 


(a) Show that the function 


<KX,r) = e~ iEt cf>(X) 


is a solution of Schrodinger’s equation (9.28). 


r 

Jr 3 


(b) Suppose that I |0(X)| 2 d 3 X = 1. Show that the probability that the particle is 


located in a smoothly bounded set S is 


P(S,t) = 


X 


WX)| 2 d 3 x, 


independent of the time. 


9.48. Define the function 0(X) = n ^ 2 ze where X = (x,y,z). 
(a) Show that 




(b) Define = e ir 0(X) as in Problem 9.47, so that ^ is a solution of the 

-4 

Schrodinger equation with V(X) = ——. We have shown in Example 6.41 using 

\\X\\ 

spherical coordinates that 

WX)| 2 


is a probability density function. Set up an iterated integral for the probability 
that the particle described by ij/ is in the set S given by ||X|| < 3. 

Figure 9.18 illustrates a level set of the probability density in (b). 



Fig. 9.18 Sketch of a level set in Problem 9.48. The plane z = 0 is indicated for reference. 

9.49. Take V(x,y,z) = x 2 +y 2 +z 2 in the Schrodinger equation, and consider the 
corresponding equation 


Ecj) = -A(p + (x 2 + y 2 + z 2 )0. 


(9.30) 









9.5 The Schrodinger equation 
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(a) Suppose X , Y and Z are functions of x, y and z respectively, and 


E x X = -X" +x 2 X 
E y Y = -Y" +y 2 Y 
E z Z = -Z" + - 2 Z 


for some numbers E x , E y and E z . Show that the function (f>(x 9 y,z ) = X{x)Y(y)Z{z) 
satisfies (9.30) for some number E expressed in terms of E x , E y and E z . 

(b) For each of the numbers E w = 1,3 and 5, find a function of the form 


W(w) = (a+ bw + cw 2 )e w ^ 


that satisfies E w W = -W" + w 2 W. 

(c) Using functions from (b), create functions 0 that satisfy (9.30) with energy levels 

E = 3,5,7,9,11,13 and 15. 

(d) Show that the probability density |e _1 ^0(X)| 2 associated with the E = 3 case is 



9 . 50 . The integral of a complex valued function / = a + ib, where a and b are real 
valued functions, is the integral of a plus i times the integral of b. 

(a) Let F = (/i,/ 2 ,/ 3 ) have complex components fj that are C 1 functions on R 3 , 
that is, the real and imaginary parts of each fj are C 1 . Then divF and curlF are 
complex. Show that the Divergence Theorem 8.3 and Stokes’ Theorem 8.4 hold 
for complex valued functions. 

(b) Let / and g be complex valued functions on R 3 that are C 2 functions, that is, 
the real and imaginary parts of / and g are C 2 . Then V/ and Vg have complex 
components and A f and Ag are complex. Prove the formula that we used in the 
proof of Theorem 9.7: if /, g, V/ and Vg tend rapidly to zero as X tends to 
infinity then 
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Problems of Chapter 1 

Section 1.1 
1.1 3.5 
1.3 

(a) < 2 ( 1 ,- 1 ) + Z?(1,1) = (<2 + b, -a + b) = (0,0). 

Add the equations a + b = 0 and -a + b = 0 to get 2b = 0, so a = b = 0. Therefore 
the vectors are linearly independent. 

(b) + 1) = (a + b,-a + b) = (2,4). 

Add the equations a + b = 2 and -a + b = 4 to get 2Z? = 6, so b = 3. Then the first 
equation gives <2 + 3 = 2 so a = -1. 

(c) <2(1,-1) + Z?(1,1) = (<2 + /?, -<2 + Z?) = (v,y). Now 2/? = x + y so b = ^(v + y). Then 
the first equation gives <2 + ^(v + y) = v, so <2 = ^(x-y). 

1.5 ^(v,y) = ax + Z?y. We need <2 + 2b = 3 and 2<2 + 3Z? = 5. Subtract twice the first 
from the second to eliminate < 2 , giving —b - - 1. Then the first equation gives <2=1. 
So €(x,y) = x + y. 

1.7 

(a) The line through 0 and U consists of all points cU; that through V and U + V is 
all V + dU. They don’t intersect because cU = V + d\J gives a nontrivial linear 

combination. Therefore they are parallel. Alternatively the line through 0 and U 

U 2 

has slope — (or is vertical if u\ =0), and the line through V and U + V has slope 

U\ 

(U 2 + V 2 ) —V 2 ^ * s ver tj ca i) These are equal, so those two sides are parallel. 
(ui +vi)-vi 

(b) Similar to (a). 

1.9 W = -U-V, so U +V +W = 0. 
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1.11 ATJ + V) = ( u\ + vi)- 8 ( 4/2 + V 2 ), while €(U ) + ^(V) = ( u\ - 8 // 2 ) + (vi - 8 V 2 ), and 
these are equal. €(c\J) = cu\ - 80 / 2 , while c£( U) = c(u\ - 8 // 2 X and these are equal. 
So £ is linear. 

1.13a + 3Z? = 4, 3a + b = 5 

1.15 (4,8) = 4(1,2) = 2(2,4) = -3(1,2) + |(2,4) and many other combinations 
U and V are linearly dependent. 

1.17 

(a) Rotation carries U to V, V to W, and W to U, so it carries U + V + W to itself. 
No vector except 0 is rotated to itself, therefore U + V + W = 0. 

(b) The sines are the y coordinates of U, V, and W where 6 is the angle from the 
x axis to one of the vectors. 

(c) The sum of n equally spaced vectors on the unit circle is the zero vector by the 
same rotation argument; the cosines are the v coordinates and therefore their 
sum is zero. 

1.19 /(.5,0) = -/(.5,0) = -100 

Section 1.2 

1.21 

(a) U • (V + W) = u\(v\ + w 1 ) + U 2 (v 2 + W 2 ) is equal to 
U • V + U • W = (u\ vi + U 2 V 2 ) + (u\W\ + U 2 W 2 ). 

(b) U- V = u\v\ +U 2 V 2 = v\u\ +V 2 U 2 = V-U 

1.23 All but (d) because all have norm one except for the last one. 

1.25 If £(x,y) = ax + by then we are given 2a + b = 3, a + b = 2. Subtract to find 
a=l,thenfe=l,soC = (l,l), £(U) = C-U. 

1.27 Replace V by -V in (1.8), or expand (u\ + vi ) 2 + ( U 2 + V 2) 2 = u\ + 2u\V\ + v 2 + 
u\ + 2u 2 v 2 + V 2 . Collecting terms gives ||U + V || 2 = ||U || 2 + 2U • V + ||V|| 2 . 

1.29 

(a) U • C = aC • C + M) • C = aC • C, soa= 

lien 2 

DU 

(b) By similar argument, or interchange the symbols, b = . 

(c) (|, |) and (- 5 , 5 ) are orthogonal unit vectors. Therefore a = 24 ^ 36 = 12. 

1.31 U = (-^.^), V = ( 2 V 2 , 0 ). 


Section 1.3 

1.33 b(U + W, V) = (mi +wi)vi = mi vi +wivi = £(U, V) + /?(W, V), and 
b(a\J, V) = 0au\)v\ = a{u\V\) = /?(U,V). 
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1.35 Because of the term qr , the variables q and r are not both in the same vector. 
Similarly r and p are not in the same vector. 

Set U = (g, p) = {u \, U 2 ) and V = (r, s) = (vi, V 2 ). Then 

f(p,q,r,s) = qr + 3rp- sp = u\V\ + 3 z/ 2 Vi -U 2 V 2 = Z?(U,V). 

Section 1.4 
1.37 

(a) (vi + w\ ,...) = (w\ + vi,...) because addition of numbers is commutative 

(b) ((vi + u\) + wi,...) = (vi + (u\ + wi),...) because addition of numbers is 
associative 

(c) c{u\ + Vl,. . .) = (i c(u\ + Vi),...) = ( CU\ + CVi ,...) = ( CU\ ,...) + (CV 1 ,...) = cU + cV 

(d) (c + d)(u \,...) = ((c + d)u \,...) = ( cu\ + du \,...) = (cu \,...) + ( dv \,...) = cU + d\J. 

1.39 X = ciUi + C 2 U 2 + C 3 U 3 = (x\ 9 X 2 9 xs) = (c 1 + C 2 + C 3 ,C 2 + C 3 ,C 3 ) gives first 
C 3 = X 3 , then C 2 -X 2 -X 3 , c\ - x\ -X 2 . 

1.41 If 

ci(l, 1,1,1) + c 2 (0,1,1,1) + c 3 (0,0,1,1) + c 4 (0,0,0,1) 

= (Ci,Ci +C 2 ,Ci +C2 + C 3 , Cl +C 2 +C 3 +C 4 ) = (0,0,0,0) 

then the first component gives ci = 0, the second then gives C 2 = 0, etc. Therefore 
the vectors are linearly independent. 

1.43 If < 2 ( 3 , 7 , 6 , 9 , 4 ) + Z?(2,7,0,l,-5) = (- ^,-|,3,|,7) then the third component, 
6 a = 3, shows that a must be Then the last component, 2 - 5b = 1 gives b = - 1. 
Checking the other three components shows that indeed ^U-V = (- ^,-^,3,^,7). 

1.45 

(a) ^(cU) = c\(cu\) H- \-c n (cu n ) = cc\u\-\ - v cc n u n = c^(U), and 

(b) ^(U + V) = ciiui +Vi ) + --- + c n (u n + v n ) = (c\u\ +c\vi) + -- + (c n u n + c n v n ) 

= (dMi + ■ ■ ■ + c n u n ) + (civi + • • • + c^vj = £(U) + ^(V). 

1.47 (1,2,3)+ (3,2,1) = (4,4,4) and this is 4(1,1,1). So -4(1,1,1) + (1,2,3) + 
(3,2,1) = 0 is a nontrivial relation among the vectors, and they are dependent. 

1.49 (a), (b), and (d) are bilinear. Only (d) is symmetric, only (b) antisymmetric. 

1.51 (a) and (d) only. 

Section 1.5 
1.53 

(a) Since U • (V + W) = u \(vi + w \) h- \-u n (v n + w n ) 

= u\v\ + • • • + u n v n + u\w\ + • • • + u n w n = U • V + U • W the dot product is distribu¬ 
tive and 
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U • (cV) = u\{cv\) H-h u n (cv n ) = c{u\V\ H- 1 - u n v n ) = cU • V, similarly (cU) • 

V = cU • V, therefore b is a bilinear function. 

(b) U • V = u\v\ + • • • + u n v n = vi u\ + • • • + v n u n = V • U shows that the dot product is 
commutative and b is symmetric. 

1.55 Three dot products are set equal to zero: 

w i + 2 w 2 - 2 w 5 = 0 
-2wi + W2 + 2w 3 = 0 
- 2 w 2 + W 3 + 2 w 5 = 0 

Since W 4 is not in the system W = (0,0,0,1,0) satisfies the equations. 

1.57 (b) and (c) only 

1.59 Add the identities (uk - v k ) 2 = u 2 - 2for k = 1,... ,n and recognize 

EJU u l = null 2 ’ ZU u kn = u • V and ZL1 ^ = IIVII 2 . 

1.61 

(a) (c,c,c,--- ,c) 

(b) nc 2 = 1 gives c = n~ 1 ^ 2 . 

(c) c = n ~ 1 ^ 2 tends to zero as n tends to infinity. 

1.63 Equations for a vector V orthogonal to Wi and W 2 are: 

v\ + v 2 + V 3 = 0 , v 2 + V 3 + V 4 = 0 . 

With any choice of v 2 and V3 these equations give v\ =V 4 = -v 2 - V3. Therefore 


V = (~V 2 -V3,V 2 ,V3,-V 2 -V3). 


Take for example v 2 and V 3 as 1,0 and 0,1. This gives independent vectors 
V = (- 1 , 1 , 0 ,- 1 ) and (- 1 , 0 , 1 ,- 1 ). 


1.65 

(a) C = (l-A,l,2), D = (l+M,2). 

(b) 2 h 

(c) The icosahedron is regular, so all edges have the same length, 2 h. 

(d) ||D - A|| 2 = (2/z) 2 , so (1 + h - 2) 2 + (1 - (1 - h)) 2 + (2 - 1) 2 = (h - 1) 2 + h 2 +1 = 4/z 2 , 
or h 2 + h - 1 = 0. Then h- ^(1 ± Vl +4), and it must be the plus sign since h is 
positive. 

1.67 

(a) By the triangle inequality \a\ = \a - b + b\ < \a - b\ + \b\ 

(b) Subtract \b\ from both sides in (a). That gives \a\-\b\<\a-b\. 

(c) Switch the numbers a and b in part (b) to get \b\ - \a\ <\a-b\. Combining that 
with part (b) gives \\a\ -\b\\<\a-b\. 
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(d) Mimic parts (a)-(c): ||X|| = ||X- Y + Y|| < ||X- Y|| + ||Y||, subtract and switch X 
and Y to get |||X|[ - ||Y||| < ||X - Y|| 

Section 1.6 


1.69 Denote U = (u\,U 2 ) etc. Then 
(a) det 


U\ + wl V\ 

U2 + W2 V2 

■ det [U Y 

CU\ V\ 

CU2 V2 

1.71 (a) 1 (b) -1 (c) 1 (d) -6 (e) -6 


(b) det 


= (u\ + W\ )V2 - (U2 + W2)vi = (u\V2~ U2V \) + (w\ V 2 ~ W2V \) 

+ det [W V], and similarly for det [U V + W]. 

= (cu\)v 2 -(cu 2 )v\ = cdet [U V] and similarly det [U cV] = cdet [UV]. 


1.73 

(a) In a permutation p\ •••p n + 1 , move the number n + 1 past k smaller numbers 
to its right; this can be done using k transpositions. Then the first n numbers 
remaining to the left of n + 1 are some permutation of 123 • • -n. Complete the 
argument inductively since none of the numbers 123 • • • n will need to be moved 
to the right of n + 1. 

(b) In 1237456 there are three cases where a larger number is to the left of a smaller 
one: 74, 75, 76. So ^(1237456) = -1. 

(c) In 1273456 there are four cases where a larger number is to the left of a smaller 
one: 73, 74, 75, 76. So ^(1273456) = 1. 

1.75 The signature of permutation p = p\P2'"Pn is the number s(p) that gives 

equality in 

~[(x Pi - x Pj ) = s(p ) P [(.Xj - Xj). 
i<j i<j 

The composite of p and q = gi #2 • • • q n can be expressed as 


P<l = PqiPq2---Pqn 


Denote x p = y qv that is, x Pk = yk for any k. Then 

- x p«) = n<y« ~ yq ^ = == 

i<j i<j i<j i<j i<j 


This shows that s(pq) = s(q)s(p). 


1.77 det(Ei,E 3 ,E 2 ) = det ooi 
from 123, so ^(132) = -1. 


= -1 and the permutation 132 is one transposition 


Section 1.7 


1.79 

(a) i(base)(height)= i||U||(||V||sin 0 ) 
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(b) yjl (uu||pr||) - Vl cos 2 0 - sin 0 

(c) i||U||(||V|| sin#) = i||U|| ||V|| yj 1 - ^ VllU|| 2 ||V|| 2 -(U-V ) 2 

(d) ||U|| 2 ||V || 2 - (U • V ) 2 = (m 2 + M 2 )(v 2 + V 2 ) - (Ml V! + M 2 V 2 ) 2 
= U 2 V 2 + U 2 V 2 ~ 2u\V\U 2V2 = (u\V2 ~ U2V\) 2 

1.81 Area is twice the area of the triangle ((0,0)(1,3)(2, 1 )), 
so it is 2(i)|(l)(l) - (3)(2)| = 5. 



Fig. 9.19 Figure for Problem 1.83. 



1.83 See Figure 9.19. 

The figure shows s(Y + W) = s(Y) + ^(W) for vectors on the same side of the line 
at left, and s(cY) = cs(\) for c > 0 at the right. 

Section 1.8 


1.85 (a) 


(b) 0 (c) -10 (d )bij 


1.87 The function M is defined by M(V) = viMi + • • • + v n M n where the are 
vectors. M is linear because: 


M(c\) = (cvi)Mi H - h ( cv n )M n = c(viMi H - h v n M n ) = cM(V), 


and 


A/(V + W) — (vi + wi)Mi + • • • + ( v n + Wn)\ 

= vjMi h -h v n M n + w\M\ h -h w n M n = M(Y) + M(W). 


1.89 (a) X • 





3 

-1 


1.91 N(MX) = NZjXjMEj = */N(ME ; -) and 

(NM)X = Yjj x j(NM)Ej. So we must show that N(ME ; -) = (NM)E ; -: 

(NM)Ej is the j -th column of NM that has i -th coordinate Yjh^h^hj- 

N(MEj) is N times the j-th column of M, that has the same i -th coordinate because 

the i -th row of N is (nn 

Therefore N(MX) = (NM)X. 
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1.93 

(a) By Property (v) of the determinant, since det A = 0 the columns of A are linearly 
dependent. By Theorem 1.20 there is a vector W not representable as AV. This 
proves (a). 

(b) For any vector U, BU is some vector and for any such vector, by part (a), 
A(BU) A W. This proves (b). 

By Theorem 1.20 the columns of AB are linearly dependent. By Property (v) then 
det(AB) = 0. 

Section 1.9 

1.95 

(a) x = 0 

(b) Take N = (0,1, l)x(-3,0,0) = (0,-3,3). The equation is -3y + 3z = 0. 

(c) Since the planes are parallel we can take the same normal N = (1,-3,5). The 
equation is (1,-3,5) • (X- (1,1,1)) = 0 or x — 3y + 5z = 3. 

1.97 All of them. 

1.99 All of them. 

1.101 

(a) X = (0, s, s) 

(b) X = (-3f,0,0) 

(c) X = (-3 t,s,s). 

1.103 Using equation (1.37), 

det[U V W] = u\ (V 2 W 3 - V 3 W 2 )- U 2 (viW 3 - V 3 W 1 ) u^(v\W 2 -V 2 W 1 ). 

Since det [U V W] = U • V x W we get 


V X W = (V2W3 - V3 VV2, ~(v\ W3 - V3W1 ), V] W2 - V2VV1 ) = (V2W3 - V3 VV2, V3W1 - V\ W3, V] W2 - V2VV1 ). 

1.105 

(a) (l,0,0)x(0,1,0) = ixj = k 

(b) j x(i + k) = -ixj + jxk = -k + i 

(c) (2i + 3k) x ( a\ + b} + ck) = 2M x j + 2d x k + 3ak x i + 3bk x j 
= 2bk - 2cj + 3aj - 3M = -3M + (3a - 2c)j + 2bk 
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<S> 



Fig. 9.20 Figure for Problem 1.107. 


1.107 See Figure 9.20. 



20 0 

(a) U • (V x W) = det 

02 0 = 

00 7 

-2 0 O' 

(b) U • (V x W) = det 

020 

007 
2 1 7] 

(c) U • (V x W) = det 

12 7 = 

007 



28. 

= -28. 

21. 
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Problems of Chapter 2 


Section 2.1 


2.1 

(a) (1,2) -X 

(b) (1,2,0) -X 

1 °1 v 
1 2 \ X 

100 lv 

1 2 0j X 

0 


(c) 


(d) 


(e) 


(f) 


(g) 


-1 0 1 
0 1 0 
1 00 
111 
0 
0 


-1 
5 
1 
1 
5 

0 -1 
-2 0 
0 1 
1 0 0 0 0 
5 0 000 
0-1000 
-2 0 000 
0 10 0 0 


2.3 The level set / = 0 is the origin together with all the points X with ||X|| > 1. The 
level sets f - c with 0 < c < 1 are (a) the two points ± a/c, (b) the sphere of radius 
Vc, and (c) the sphere in R 5 centered at the origin of radius yfc. All other level sets 
are empty. 


2.5 £(x,y,z) = 2x + 3y-z defines a function € : R 3 —> R for which £(x,y,z) = 0 gives 
the plane z = 2y + 3 y. Any nonzero multiple of 2v + 3 y — z would also work. 


2.7 

(a) Subtract ||A || 2 + 2A-U from each side of ||A + U || 2 = ||A || 2 + 2A-U + ||U|| 2 , giving 

l|A + U || 2 - (||A || 2 + 2A • U) = ||U|| 2 . 

Since the first term ||A + U || 2 is equal to /(X) and the second is g(X), this proves 
what we want. 

(b) We are assuming ||X - A|| < 10 -2 . That is, ||U|| < 10 -2 . Part (a) gives 
/(X) — g(X) = ||U|| 2 , so 


|/(X)-g(X)| = ||U || 2 < 10 -4 . 
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2 

2.9 Take x 2 -y 2 = c(x 2 +y 2 ) when y A 0 and c A 1. Divide by y 2 to get - 1 = 

c((p 2 -hi), from which This gives two lines (minus origin) v = ±{^y. 

Note that can be a^y nonzero. 

2.11 

(a) Level sets are the origin, unit circle, and circle of radius V2. 

(b) Level sets are the origin, unit circle, and circle of radius 2. 

(c) Level sets are empty, unit circle, and circle of radius . 

2.13 

(a) Level sets are spheres of radius -^=, 1, V2 centered at the origin. 

(b) Level sets are spheres of radius 1,2. 

(c) Level sets are spheres of radius V2, 1, -j =. 

(d) X = aU + V, /(X) = aU • U + U • V = a = c. So X = cU + V, V any vector 
orthogonal to U. This is a plane through c\J and orthogonal to U. 


2.15 According to Theorem 2.1, L(X) = 


*1 


Cl 1*1 "•-•" C\ n X n 

C2\X\ H- \-C2n%n 



Cml*l 4- 

r c n i 


c \n 

C 21 


c 2n 


+ • 

" + x n 


- c m\ - 


- c mn - 


. Since this is equal to 


the numbers multiplying x\ are the first column, X 2 the second, and so on. 
So L(X) = x\ V"i + • • • + x n \ n . 


2.17 We have /(X) = 1 and g(X) = \\X \\~ 2 for all X A 0. So the level set /(X) = 1 is 
R 3 - {0}, g(X) = 1 is the unit sphere in R 3 where ||X|| = 1, g(X) = 2 is the sphere of 
radius -^=, and g(X) = 4 is the sphere of radius \. There are no points where f = 2 
because f = 1 everywhere it is defined. 

2.19 

(a) F (t, 0 ) = (tcos 6 ,tsin 6 ). 

(b) The rectangle maps to a segment of the unit disk 

2.21 \\L(x,y,z)\\ = x 2 +z 2 +y 2 = 1, so L maps the sphere into itself. The right half¬ 
plane goes into the unit disk. The unit disk goes into the left half-plane. 


2.23 

(a) This uses two statements of Taylor’s Theorem 

m=m+f(e 1 )u, m=m+rm+y^u 2 

using the first, and then second derivative remainders. 
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(b) Apply part (a) with a = ||A|| and u = 2A • U + ||U|| 2 . 

(c) Reading from the first derivative Taylor formula we see Li(U) = ||A||“ 3 U, 

“large” = (- |(||A || 2 + 6>i)“ 5 /2 (2A + U) • U)(A + U) 
with norm less than some multiple of ||U||, and L 2 (U) = - 3j^jA, 

“small” = -§||A|r 5 ||U || 2 + f (||A || 2 + e 2 )“ 7/2 ( 2A • U + ||U|| 2 ) 2 (A + U), 

with norm less than some multiple of ||U|| 2 . 

Section 2.2 

2.25 

(a) Combine continuity of g and of F: given e > 0 there is 6 so that 
if ||F(B) - F(A)|| < 6 then |g(F(B)) - g(F(A))| < 6 

and there is 77 so that if ||B — A|| < 77 then |F(B) - F(A)| < 5. Therefore if 
IIB - A|| < 77 then |g(F(B)) - g(F(A))| < 6 . 

(b) Linear functions are continuous. 

(c) Combine (a) and (b). 

(d) Take g(x,y) = xy in (a). To show that g is continuous at (< a,b ), show that if 
\\(x,y)-(a,b)\\ < 6 then 

\xy - ab\ = \xy -xb + xb- ab\ < (\a\ + 5)6 + 5b < MS. 


2.27 

(a) 6/2 

(b) 3d 

(c) We know that \x - a\ < ||(x,y) - (a,b)\\ and \y - b\ < ||(x,y) - (a,b)\\. If 
||(x,y) - (a,b)\\ < m then 

| cos(2x) cos(3y) - cos(2a) cos(3d)| = |( cos(2x) - cos(2a)) cos(3 y) + cos(2 a)( cos(3_y) - cos(3d))| 

< | cos(2x) - cos(2a)| | cos(3y)| +1 cos(2a)| | cos(3_y) - cos(3d)| 

< | cos(2x) - cos(2a)| +1 cos(3j) - cos(3d)| <2 m + 3m. 

Take m = 6/5. 

2.29 

(a) True. Observe that 0 is between the values /(1,0) and /(0, \). Consider the 
continuous function of one variable g(t) = f(X(t)) where X(0 parametrizes the 
line segment from (1,0) to (0, |). By the Intermediate Value Theorem for con¬ 
tinuous functions of one variable, there is a t\ where g(t\) = 0 . 

(b) True. 

(c) False, that is, there are such functions / for which this does not hold. 

(d) True; this is part of being continuous at (0, |). 
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2.31 These are all compositions and products and sums of continuous functions. 
Therefore they are continuous. 

2.33 The graph is part of a plane in R 3 . The maximum value of / occurs, by 
inspection, at the boundary circle x 2 + x 2 = 2 where X 2 is as large as possible. This 
is the point {x\,X 2 ) = (0, V2), so the maximum value is V2. 

2.35 The inequality x 2 +y 2 < 1 confines (x,y) to the open unit disk, and places no 
restriction on z. Around any point of the unit disk you can center a small open disk 
contained in the unit disk. For each point (x,y,z) then there is a small open ball of 
that same small radius centered at ( x 9 y 9 z ) and contained in the cylinder. Therefore 
the cylinder is open. 

2.37 

(a) Boundary is the three edges v = 0, y = 0, and x + y = 1. 

(b) The point is .0001 from the left edge, and .0001 vertically thus .0001/ V2 > 
.00005 distance from the hypoteneuse. Take r = .00001. If Q is any point within 
distance r of (.0001, .9998) then Q is certainly inside a square 

.0001 - r < x < .0001 + r, .9998 - r < y < .9998 + r, 

that is, 

.00009 < v < .00011, .99979 < y < .99981. 

The top right point of that square is (.00011, .99981), still inside T because 

.00011 + .99981 = .999982 < 1. 

Since this little square is contained in 7 1 , a disk of radius r centered at 
(.0001, .9998) is also contained in T. 

2.39 

(a) c can be taken to be the matrix norm, Vl 2 + 5 2 + 5 2 + l 2 = V52, or any larger 
number. 

(b) d = c, using linearity of F. 

(c) Yes, by part (b): if ||X - Y|| < then ||F(X) -F(Y)|| < 6. 

2.41 

(a) If C = 0 the function is constant so is uniformly continuous. Otherwise: 

|/(X)-/(Y)| = |C-(X-Y)| < ||C||||X-Y||. 

Let e > 0. For any X and Y with ||X-Y|| < then |/(X)-/(Y)| < e. 

IIMI 

(b) One way: this is a polynomial in the variables {x\ 9 ... 9 x n 9 y\ 9 ... 9 y n ) 9 therefore 
continuous. Or we can prove directly that g is continuous at each point (U, Y): 


g((X, Y)) - g(U, V)) = X • Y - U • V = X • (Y - V) + (X - U) • V 
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has absolute value 


<||X||UY-V|| + ||X-U||HV|| 


If ||(X, Y) - (U, V)ll < S, then each of ||X-U|] and ||Y- V|| is less than 6 , and 
our estimate becomes 

|g((X,Y))-g(U,V))| < ||X||<5 h-< 5||V||. < (||U|| + d)d + d||V||. 

This is sufficient to show that g is continuous at (U, V), but since the estimate 
depends on (U, Y) we don’t get uniform continuity. 

2.43 Since ||X|| > 2 you have Similarly for Y. Therefore 

l|X|l 


1 

1 


l|Y||-||X|| 

M' 

‘M 


l|X||||Y|| 


<Ull|X||-||Y|||<i||X-Y|| 


2 2 

This shows uniform continuity. 

Section 2.3 

2.45 e runs from 0 to 7r, and r > 0, so your sketch ought to show a “rectangle” of 
height n that extends infinitely far to the right. 

2.47 (a) 0 < r < 1, any 6 . 

(b) 0 < r and 0 < 6 < |. 

2.49 (a) 1 = rr~ l = r(a + bcos6) = ar + bx = a a /* 2 +y 2 + bx. 

(b) Subtract bx and square: 1 -bx- a a/x 2 +y 2 , 1 - 2bx + b 2 x 2 = a 2 (x 2 + y 2 ) 

So 1 = ( a 2 - b 2 )x 2 + 2 bx + a 2 y 2 . Complete the square to get 


1 = (a 2 -b 2 )\x + 


b 


b 2 


Add 


a 2 -b 2 ) a 2 - b 2 
2 


2 2 

+ ay . 


i 2 -b 2 


2 2 a 2 — b 2 

t0 s et ^2 = (a 2 ~b 2 )(x + + cry 2 . Multiply by —— to get 




1 


(*+^2) 


(A) 

That is an equation for an ellipse. 

2.51 (a) (iii) (b)(i) (c) (ii) (d) (iv) 


( 1 ) 


2.53 

(a) 1 , spheres 

(b) 52(1,0,0) = 0 . 





















434 


Answers to selected problems 


(c) zero. 

(d) 3 cos 2 0 1 is zero where cos 0 = ± ^[T|3, a cone. 

(e) negative because the cos 2 0 is less than 1/3 there. 

2.55 

(a) In the expression (xu - vy,yu + vv), each of the two components is a continuous 
function of (x,y,u,v). So multiplication zw is a continuous function of z and w. 
In particular, the function z 2 is continuous, and then z 3 = z 2 z is a product of 
continuous functions so it is continuous, and so forth for all the powers of z. 
Also multiplying z k by pk is continuous. And the sum of continuous functions 
is continuous as we already know. Therefore polynomials are continuous. 

(b) Squaring doubles the angle 6 and squares the polar r: 

(r(cos# + isin 0)) 2 = r 2 (cos( 20 ) + isin( 2 #)). 


Therefore to find a square root of a complex number, take half the angle, and the 
square root of the absolute value. Similarly to find a cube root, divide the angle 
by three and take the cube root of the absolute value, etc. 

(c) \x + iy| = ^/x 2 +y 2 is the same as the norm in R 2 , that we know is continuous. 
When z = r(cos# + isin#) you have |z| = r, so \zw\ = \z\\w\ follows from 

n(cos0i +isin#i)r2(cos#2 + isin#2) = n^2(cos(#i +#2) + isin(#i + 62 )). 


(d) In the first part you can take P = \p n -i I + • • • + \po\- 

(e) We are assuming \p(z)\ is positive, and we know it is continuous and tends to 
infinity as |z| tends to infinity. Therefore / is continuous on all of R 2 and tends 
to zero as |z| tends to infinity. 

(f) By the product rule 

2- ^q(a) + q\a)(z - a) + q"(a) — ^ + • • • + q (n \a) ^—^ j 


= q'(a) + q"(a)(z - a)+ q'(a)(- 1 ) + q'"(a) — ^ + q"(a)( ~(z~ a )) 


2 ! 

Jn+lh^ iz-ay 1 , 


. 71-1 


■ + q {n+L) (a)- 


■ + q {n) (a)- 


(n- 1 )! 


Since q has degree n , q^ n+l \a) = 0, and all other terms cancel in pairs, so the 
derivative is zero. The contradiction follows from part (d), using a rather than z 
as the variable. Finally taking a- z evaluates the right hand side as q{a) since 
all other terms become 0 . 

(g) The contradiction arises because the terms indicated as • • • all involve higher 
than the k -th power of e. The sum of these terms is smaller than me k when e 
is sufficiently small, giving \p(z + eh)\ < m, contradicting part (e). 


2.57 The Cartesian coordinates of the points are 
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(sin 0 i cos#i,sin 0 i sin#i,cos 0 i), (sin 02 cos 62 , sin 02 sin 62 , cos 02 ), 

so the dot product is 

sin 0 i sin 02 (cos cos 62 + sin#i sin # 2 ) + cos 0 i cos 02 
= sin 0 i sin 02 cos (#2 - 0 \) + cos 0 i cos 02 

= sin 0i sin 02 cos (#2 - 6 \) + cos (02 - 0 i) - sin 0 i sin 02 = cos (02 - 0 i) - sin 02 sin 0 i (1 - cos ($2 - 6 \)). 
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Problems of Chapter 3 
Section 3.1 

3.1 (a) 6x, (b) 4. These are the x and y partial derivatives of 3x 2 + 4 y, 

3.3 

(a) /,(*,>>) = -lxe~ x2 ~ y2 , f y { 2 , 0 ) = 0 , 

(b) xe y + c x 

(c) -y sin(xy) + cos(xy) - xy sin(xy) 

3.5 


(x,y) 

(. 1 ,- 2 ) 

(. 01 ,. 02 ) 

(1 +x + 3y) 2 

2.89 

1.1449 

1 +x+3y 

1.7 

1.07 

1 + 2 x + 6y 

2.4 

1.14 


3.7 Level sets of m are closer, and Vm is longer. 

3.9 

(a) Any linear function is given by some matrix. 

(b) A vector tends to zero if and only if each component tends to zero. 

(c) With these substitutions the fraction becomes the definition of partial derivative. 
The fraction tends to f^ Xj by definition. And the j-th component of row Q is the 
(/,/) entry of the matrix C. 

3.11 V/(x, y) = (- sin(x+y),~ sin(x+>•)) so f x -f y = - sinCv + v) - (- sinU +>’)) = 0. 
Wg(x,y) = (2 cos( 2 v - v), - cos( 2 v- y)) so 
g x + 2 g y = 2 cos( 2 x - y) + 2 (- cos( 2 x - y)) = 0 . 

3.13 Vg(x,y) = (ae ax+by ,be ax+by ) = e ax+by (a,b) 

Section 3.2 
3.15 

(a) z = f(0,0 ) + f x (0,0)x + f y (0,0)v = x 

(b) z = f(h2) + Ml2Xx- 1) + fy(l,2)(y - 2) = 5 + (x- 1) + 4(j - 2) 

3.17 

(a) If a 2 + b 2 = 1 then 


f(a,b) = e - (fl2+fo2) = e _l , g(a,b) = e _1 
(b) Vf(x,y) = e~ <x2+y2> (-2x, -2y) and Vg(x,y) = -e “ 1 (x 2 + _v 2 ) _ 2 (2.v,2\ ! ) 
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(c) At point (< a , b) where a 2 + b 2 = 1 you have by part (b) 
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Vf(a,b) = z~\-2a,-2b\ Vg(a,b) = -e~ l (2a,2b) 

and f(a , b) = g(a , &) by part (a). Since the values and gradients of / and g agree, 
the graphs are tangent. 

Section 3.3 

3.19 

(a) fxx — 2 , fxy= 0, fyy = -2 

(b) fxx - 2, Ay = 0, /^ = 2 

(c) Ax = 2, f xy = 2, Ay = 2 

(d) Ax = e _x cosy, Ay = -e“ x siny, Ay = -e“ x cosy 

(e) f xx = -a 2 e~ ay sin(ax), fxy - -a 2 o~ ay cos(ax), f yy = a 2 o~ ay sin(av) 

3.21 (a) (b) and (c) only 

3.23 V/(x,y) = (3 +y,2 + x), V/( 1,1) = (4,3). The directional derivatives are 


V/-V = 4, 5, 3, 0, -4. 


The second one, the largest, is equal to ||V/(1,1)|| because V is in the direction of 
V/(l, 1). 

3.25 

(a) If u(x, t) = f(x - 3t) then according to the Chain Rule, 
u x = f'(x - 3 1), u t = f\x- 30(-3), so u t + 3 u x = 0. 

(b) Dyu = V- Vm = -^L(3,1)• (u x ,u t ) = -X=(3k* + h,) = 0. 

(c) The slope of a line (v horizontal, £ vertical) parallel to V is 1/3, as is the line 
that goes through (x-3t, 0) and (x, t). 

(d) Since u(x,t ) is constant on the line through (x- 3^,0), it follows that u(x,t) = 
u(x-3t,0). This shows that every solution is a function of x-31. 

3.27 

(a) (x 2 +y 2 ) 3 = r 6 , and ((v 2 +y 2 ) 3 ) x = 6v(v 2 +y 2 ) 2 , similarly for y. Then 
x(6x(x 2 +y 2 ) 2 )+y(6y(v 2 +y 2 )) = (6v 2 + 6y 2 )(v 2 +y 2 ) 2 = 6r 2 r 4 = 6r 5 r = r^-(r 6 ). 

(b) r = *sjx 2 +y 2 , so if f(x,y) = g(r) then f x = g'(r)r x = g'(r)and similarly for A- 


Then 
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3.29 Differentiate the change of variables v = rcos#, y = rsin# to get 

x r = cos$, y r = sin6, xe = -rsin6, yo = rcos6. 

The Chain Rule gives for any differentiable function v of v and y 

v r =v x cos# + VySinO 
vq =v x (-rsin6) + v y rcos6 

Multiply the first by r and use the change of variables again to get 


rv r =xv x + yv y 

V6=-yV X +XVy 

The formula for v r applied to u gives u r = z/ x cos# + u y sin0, and the r partial of this 
is 

u rr = u rx cos 6 + Uyy sin 6. 

The formula for v r applied to u x and u y gives 

u rr = ( u xx cos 6 + Uxy sin 6) cos 6 + ( u yx cos 6 + u yy sin 6) sin 6. 

The same process gives 

uqq = -rcos6u x -yu X Q-rsin6u y + xUyQ 

= -xu x - yu y - y(-yu xx + xu yx ) + x(-yu yx + xu yy ). 

This gives uqq = -ru r + y 2 (u xx + u yy ). Divide by r 2 and add u rr to get 

1 1 

— Uqq + U rr =- U r + Au, 

r z r 


or Au = u rr + \u r + -jsUQQ. 

3.31 

(a) u x = 2x, v y -2x- u x , u y = -2 y, v x = 2y = —u y 

(b) U XX + Uyy = Vy X + ( _ Vyy) = 0 

(c) w(x,y) = u 2 -v 2 = (x 2 -y 2 ) 2 - (2xy) 2 = x 4 - 6x 2 y 2 +y 4 , 
so + w yy = I2x 2 - 12y 2 - I2x 2 + 12y 2 = 0. 

(d) w(x 9 y) = r(p(x 9 y) 9 q(x 9 y)) gives = r x p x + r y q x and w y = r x p y + r y q y . Then 

V^XX — (?XX Px + V X yCj X )p x + V X P XX + (r yx p x + Y yyC[ X )C[ X + Y yCj XX 


and 


Wyy = iTxxPy + r xy q y )p y + Y x Pyy + (r yx p y + Yy y qy)q y + r y q y y. 


Therefore 
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Wxx + Wyy — (p x + Py)rxx + (2-CfxPx + 2p y q y )rxy + (P x Qy) r yy r x(Pxx + Pyy) + r y(plxx + #yy)- 

Since = q y and /?y = -q x the last expression is 

M-’xt + Wyy = (P X + Py)(r X X + Tyy) + r x (p xx + Pyy) + r y (q xx + qyy). 

But p xx + pyy, q xx + q yy and r xx + r yy are all zero, so + w yy = 0. 

3.33 

(a) (x + iy ) 2 = x 2 - y 2 + 2 ixy so u = x 2 -y 2 and v = 2 xy. 

(b) (z + h) 2 =z 2 + 2zh + h 2 . The term 2 zh is a complex multiple of h , and /t 2 is small 

/t 2 9 

in the sense that — = h tends to zero when h does. Therefore (z ) = 2 z. 
h 

(c) If h is real then z + h = (x + h) + iy and by Taylor’s Theorem applied to u and v 
you have 

f(z + h) = u(x + h,y) + iv(x + h,y) = u(x,y ) + u x h + i(v(x,y) + v x h ) + s 

where s is small. That says f(z + h) = f(z ) + ( u x + iv x )/z + s, so /' = u x + iv x . 

(d) Take h = ik pure imaginary. Then 

/(z + /i) = w(v,y + /:) + iv(v,y + k) = u(x,y) + ku y + i(v(v,y) + Vyk) + s 

= f(z) + (My + i Vy)k +s = f(z) + (u y + iVy)(-i/0 + s. 

Therefore /' = -it/y + v y . 

(e) /' = u x + iv x = -i u y + Vy gives u x - v y and u y = -v x . 

(f) If C then lixx — (Vy)x — Vxy — ( £Ty)y SO llxx 4" Uyy — 0, 

similarly Av = 0. For z 2 you have A(x 2 -y 2 ) = 2-2 = 0, A(2xy) = 0 + 0, 
and for z 3 = v 3 + 3v 2 iy - 3xy 2 - iy 3 you have A(x 3 - 3 xy 2 ) = 6x + (~6x) = 0 and 
zf(3* 2 y-y 3 ) = 6 y- 6 y = 0. 

Section 3.4 


3.35 

(a) £>F = 


e^cosy -e"smy 
e x siny e x cosy 

2x 


(b) detDFOr, v) = e Zx , delDF(a,b) = e 2 " is not zero for any a. 

(c) F (x,y + 2n) = F(x,y) because of the sine and cosine. 

3.37 

(a) (-l ) 4 + 2 (-l)(l)+ 1=0 and 1 = 1 

(b) Z)F = [ 4x3+2 J 2 ^ 

(c) detZ)F(-l, 1) = det 
7-1 


-2 ■ 
0 


-2 A0 


(d) DF" 1 (0,1) = DF(-1,1 )“ 1 = 


4 - 1 

0 1 
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F -1 (w,v) w F _1 (0,1) + 


-5 "I' 

u - 0 

0 1 

v — 1 


gives 


F _1 (.2,1.01) ~(-l,l) + 


-2 " I ' 

' .2 

0 1 

.01 


= (- 1 . 11 , 1 . 01 ). 


3.39 

(a) No, since f z ( 1,2,-3) = 0. 

(b) Yes, since f y (l,0,3) A 0. 

(c) Yes, since /y(l,2,3) A 0. 

(d) No, because the y values are different: 


0 = gpart(b)(l,3) A 2 = g par t(c)(l,3). 


(e) No, because f x ( 1,0,3) = 0. 

3.41 

(a) 3(0) + (4) 2 + 4(-4) = 0, 4(0) 3 +4 + (-4) = 0. 

(b) A' 1 { l v - = . ? 2 2 ' i and at (>’i ,>’2) = (0,4) this is 3 * 

hy\ J2y 2 \ [12^i 1J [O f 

there is a function G. 

(c) From fi{x,gi{x),g 2 {x)) = 0, f 2 {x,g\{x),g 2 {x)) = 0 we get 


that is invertible. So 


fix 

_l_ 

fh\ 

hyi 

A 


0 

fix 


hyx 

hy 2 _ 

A. 


0 


At v = -4 this gives 


4 

-i_ 

3 8 

s[ (-4) 


0 

1 


0 1 

g' 2 (~4) 


0 


Vi(-4)' 


3 8 

-l 

4 


r 1 8 i 

3 3 

4 


r 4] 

3 



0 1 


1 


0 1 

1 


-1 


3.43 

< a >[» w] 

(b) The determinant is not zero when y\ A 0. 


(c) y = ± Vl -x 2 , 


ux _ — ux 

y vft? 


(d) x = ± Vl-J 


M=-S= T - 




Section 3.5 


3.45 

(a) div (3F + 4G)( 1,2, -3) = 3div F( 1,2, -3) + 4div G( 1,2, -3) 

= 3(6) + 4(l 2 -2(-3)) = 46 

(b) div curl F(x,j,z) = div(5y + 7z,3x,0) = £(5y + 7z)+ |(3x)+ |(0) = 0 
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(c) curl (3F + 4G)(1,2, -3) = 3curlF(1,2, -3) + 4curl G(1,2, -3) 

= 3(5(2) + 7(-3), 3(1), 0) + 4(5,7,9) = (-13,37,36) 

3.47 The vectors H( 1,0) = (0,1), H(0,1) = (-1,0), H(-1,0) = (0, -1), and H(0, -1) = 
(1,0) suggest counterclockwise rotation. But: 

cur iH = (- T V)-(- T V) 

\(x 2 +y 2 )P J x \(x 2 +y 2 )P Jy 

1 2 * i 1 2 y 

(x 2 -\-y 2 )P ^ (x 2 -\-y 2 )P +l ( x 2 +y 2 )P ^(x 2 +y 2 )P +l 

_ (.x 2 + y 2 ) - 2 px 2 + (x 2 +y 2 ) - 2 py 2 (2 - 2p) 

(x 2 + y 2 ) p+1 (x 2 + y 2 )P 

The sign depends on p : negative when p = 1.05 or any p > 1, positive when p = .95 

or any p < 1, and the curl is zero when p = 1. 

3.49 

(a) = (Pl> ! — v)x ~ Plyx ~ Vx = .Pixy — — Uy — V x — 0 

(b) P2* = Plx = M, = V + W-C y = V 

(c) ra x = (P2z - w)jc = P2xz -w x = u z -w x = 0 and m >; = (p 2z - w) y = p 2yz -w y = 

V Z ~Wy = 0. 

(d) P3z = P2z~% =w + m-m = w 

3.51 

(a) The Chain Rule gives X" = (DV)X' + V, = V, + (DV)V. 

(b) Since X(f) = Ci +r 1 C 2 , V(X(r),0) = X'(f) = -r 2 C 2 and X" = 2r 3 C 2 ). 

Since V(X) = r'iC, -X), you have V, = -r 2 (Ci -X) and D\ = -r 1 /. Then 

v ; (X(r), o) + ov(X(r), t))v(x(t), o) = -r 2 (Ci - x) - r'v(X(o, o) 

= -r 2 (C! -(Cj +r 1 c 2 ))-r 1 (-r 2 c 2 ) = -2r 3 c 2 

and this is equal to X"(X(t),t). 

3.53 The first term is, by the Mean Value Theorem, 

~{f{x 0 +Ax,y m ,Zm) ~ 0 )AyAz = (- fx(x,y m ,Zm), o, 0 )AxAyAz 

for some point 2 between zo and zo +2lz- Similarly for the other two terms. 

3.55 For each k = 1,2,3, the product rule gives (uv) XkXk = u XkXk v + 2u Xj v Xk + uv X]cXk . 
Sum over k. 

3.57 Begin with the right side. First calculate, writing terms in cyclic 123123 order, 
(divG)F + G-VF 
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= ((glxi + §2x2 + 83x 3 )/l +(gl/ljq + 82/lx 2 +^ 3 / 1 x 3 ), 

(g2x 2 + #3x 3 + glxi )/2 + (82/2x2 + 83/2x3 + gl/Xq X 

(#3x 3 +&lxi + 82x 2 )/3 + (^ 3 / 3 x 3 +g l/3xi + ^2/3 x 2 )) 

Interchange F and G and subtract to get 

(div G)F + G VF - ((div F)G + F VG) 

= (82 x 2 /i -hx 2 8l +82/U2 -/28 \x 2 + 83x3/1 -/3x 3 8l +^3/1x3 ~/38lx3> 
83x3/2-/3x382 + 83/2x3 - h82x 3 + 8\x x h~ hx x 82 + 8\hx x — Z\82x\ ? 
83x3/2 — /3X382 + 82/2x3 — Z282X3 +81x1/2 — Zlx x 82 +8 1/2x1 — Zl82x x ) 

Group terms using the product rule: 

= ((/l82-/28l)x 2 -(/38l-/\83 )x 3 , 

(Z283 - /382)x 3 - (Z182 - /2gl)xi, 

(/381 -/i83)x l -(/283-/382)x 2 ) = curl(FxG). 

3.59 

(a) Identity (g): 

curl curl F 

= ((/2x\ ~ /lx 2 )x2 — C/1.X3 ~ /3 xi)x 3? 

(/ 3 x 2 ~ X1X3 )x 3 — C/Zxi — /lx2)xi ? 

C/IX3 — /\x\ )xi — (/ 3 x 2 ~ Z 2 X 3 )x 2 ) 

= (/2x\x 2 ~ Zlx 2 X2 ~ Z1X3X3 + Z 3 x\X 3 9 

/3x 2 X3 ~ Z2X3X3 ~ /2x\x\ + /lx2Xi9 

/lx 3 xi — /3xixi ~ /3x 2 X2 +/2X3X2) 

+(/lxixi — /lx]X] 9 /2x 2 X2 ~ /2x 2 x 2 9 Z2X3X3 ~ /3X3X3) 

= V(divF)-zlF 

(b) Identity (h): 

^(/g) = (/^)xixi + (/g)x 2 x 2 + (/ 8 ) X 3 X 3 

— Zx\X\8 + 2 fx x g Xl + fgx lX] + ■■■= gdf + 2 V/ • Vg + /zfg. 

(c) Identity (i): By Identity (d) and curl Vg = 0 we have curl (/Vg) = V/x Vg. 
Since V/ x Vg is a curl and div curl H = 0 we get div (V/ x Vg) = 0. 
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Problems of Chapter 4 

Section 4.1 
4.1 

(a) f x = -x(x 2 +y 2 + z 2 ) _3/2 , f xx = -(x 2 +y 2 + z 2 ) _3/2 + 3x 2 (x 2 +y 2 + z 2 )~ 5/2 and by 
symmetry f zz = -(x 2 +y 2 + z 2 ) _3/2 + 3z 2 (x 2 +y 2 + z 2 )~ 5/2 

(b) 0 

(c) 0 

(d) h Xj = dj is constant, so second derivatives h XjXk are 0 

4.3 f x (x,y ) = \{x 2 + y 2 ) 1/2 ( 2x) = 3x(x 2 + y 2 ) 1/2 , and when (x,y) A (0,0) 

fxy(x,y) = lx(x 2 +y 2 r 1/2 ( 2y) = 3^(x 2 +/r 1/2 . 


4.5 

(a) ^ = ae ax+l, y +cz+dw = ag 

(b) (P'd 2 Q ax+ by +cz+ d w yi c 2 Q ax+by+cz+dw 

(c) a 2 d 2 + b 2 c 2 - 2 abed = {ad - be) 2 = 0 so ad = be. 

(d) a 2 d 2 + b 2 c 2 -2 = 0 . 

4.7 U, = n r l- n /2 e -\\Xf,4t + r n/2 e -||X|| 2 /4. = 

2 4r 2 2 4^ 2 

Xj 1 X 7\2 

Similarly = -—m, Ux i x i = ~2f U + ^~Yr U ’ S ° 

1 v 2 + • • • + v 2 

Mjcud +-” + ^JC n =W(-—)M+-^- U = Ut - 

4.9 Differentiate r 2 = x 2 H-h x 2 to get r X]c = —. Therefore (r p ) Xk - pr p ~ 2 x^. Then 

by the product rule (r p ) X]cXk = p(p - 2)r p ~ 4 XkXk + pr p ~ 2 . Sum over k to get 

A(r p ) = p(p - 2)r p ~ 4 r 2 + npr p ~ 2 = p(p -2 + n)r p ~ 2 . 


This is zero if p = 2-n. 

4.11 

(a) ((v 2 +y 2 + z 2 ) 1/2 ) x = v(v 2 +y 2 + z 2 ) _1/2 and 

((v 2 +y 2 +z 2 ) 1/2 ) xx = (x 2 +y 2 +z 2 ) _1/2 - v 2 (v 2 +y 2 +z 2 )“ 3/2 . 
So by symmetry zl((v 2 +y 2 +z 2 ) ly/2 ) = (3 - l)(x 2 +y 2 + z 2 ) -1 / 2 . 
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(b) u x = v r r x -v r ~ r and u xx = v rr ^ + r l v r -v r ^j‘, similarly for u yy and u zz . Then 

x 2 +y 2 + z 2 _! x 2 +y 2 +z 2 „ _i 

Au = v rr ---l-3r v r -v r --- =v rr + 2 r v r . 

r z r 5 

(c) Using parts (a) and (b), 

AAr = A(2r~ l ) = 2((-l)(-2)r“ 3 +2r“ 1 (-l)r“ 2 ) = 2(2-2)r“ 3 = 0. 

Section 4.2 


4.13 


(a) V fix, v) = (-2 + 2x, 4 - 8v + 3 y 2 ) = (- 2 + 2 x,(y- 2)(3 y - 2)) is zero at (1,2) and 
at (1, |). 

(b ) <H(x,y)=l Ql ° W(lJ)=n ?,W(1,2) = 


2 0 
04 


(c) 7/(1,2) is positive definite because U • *7/(1,2)U = 2w 2 + 4v 2 is positive except at 
the origin. *7/(1, |) is indefinite because 2u 2 -4v 2 has both positive and negative 
values. Therefore / has a local minimum at (1,2) and a saddle at (1,1). 


4.15 

(a) 3x 2 + 4 vi ^2 + x 2 = X • 


3 2 
2 1 


(b) -x 2 + 5x 1 x 2 + 3x 2 = X-I -2 


- 3 
2 J 

4.17 V/ = (~3x 2 + 2x + y, x + 6y). The first derivatives at (0,0) are zero, and 

fxx(0,0) /xy(0,0)l _ r 2 1] 

/yx(0,0) fyy( 0,0) J — L l 6 J 

is positive definite. Therefore /(0,0) is a local minimum. 

4.19 Complete the square: x 2 + qxy + y 2 = (x+ \qyf + (l - ^)y 2 . That is positive 
definite if \q\ < 2. 


4.21 At (0,0), 
forms are 


fxx fxy 


2 0 


§xx §xy 


0 o' 

fxy fyy 


04 

’ 

§xy Syy 


04 


. The corresponding quadratic 


S f(u,v ) = 2u 2 + 4v 2 


which is positive definite, and S g (u, v) = 4v 2 which is not because the values S g (u, 0) 
are not positive. 

4.23 The closest point is (- 1, ^). This is also a normal vector to the plane because 

the equation for the plane can be written as 


-\x + y+\z=\. 
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A sketch makes clear that the distance to the origin is minimum when that distance 
is measured along a line normal to the plane. 

Section 4.3 

4.25 

(a) V/(v,y,z) = (,log(l +x)e J ,cosz) is nowhere the zero vector, so there are no 
local extrema. 

(b) /(0,0,0) = 0, V/(0,0,0) = (1,0,1), and 



r 

(l+x) 2 

1+Jt 

0 


-1 1 0 

*Hf(x,y,z ) = 

1+x 

log(l + 

0 

, <Hf( 0,0,0) = 

1 00 


0 

0 

-sinz 


000 


so 


/72(H) = /(0,0,0) + V/( 0,0,0) • H + ±H r 7T/(0,0,0)H = h x + h 3 + \ (~h\ + 2hih 2 ). 

4.27 

(a) V/(x,y,z) = (2x+y,x + 2z+y,2y + 3z 2 ). This is (0,0,0) when 


y = -2x, z = -^x + x, 2(-2x) + 3{\x 2 ) = 0, 


(b) 


giving (x,y,z) = (0,0,0) and (x,y,z) = A = (y,-y, §)• 


<Hf(x,y,z) 


2 10 1 
1 1 2 
0 2 6z 


<w/(0,0,0) 


2 i o 
1 1 2 
0 2 0 


<H/(A) = 


2 1 0 
1 1 2 
0 2 16 


For < K/( A) you have determinants 2 > 0, 1 > 0, -2(4) + (16) = 8>0so /(A) is 
a local minimum. 

(c) For 7T/(0) the determinants are 2 > 0, 1 > 0, and -8 < 0 so the determinant test 
for positive definiteness doesn’t help. But /(0,0,z) = z 3 has the same sign as z 
so /(0,0,0) is a saddle. 


4.29 

(a) /(X) = (x 2 + • • •) -1/2 so f Xj (X) = - j(x 2 + • • • T 3I2 2xj = -(x 2 + • • • )“ 3/2 x ; - and 
fxjxtiX) = 3(xj + • • • )~ 5,2 x k xj-(xf + ■ ■ • )~ 3/2 ^ = 3(xj + • • • T 5,2 x k xj - (x 2 + • • • ) -3/2 fyt 


where ^ is 1 if j = k else 0. 

(b) /(A) = HAir 1 and from part (a), /..(A) = ||A||- 3 « 7 - and 


4 x,(A) = 3||A|r 5 a*aj - ||A|r 3 fy = ||A|r 5 (3a*a ; - - ||A|| 2 fyt)- 


So the Taylor approximation is 
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IIA + Hir 1 «p 2 (A + H) Bf jlAir 1 + ||A||- 3 ^aA+ 5ll A ll" 5 Yj (( 3a kaj-\\M\ 2 S jk ))hjh k 

i= i i,k= l 

= ||A||- 1 + ||A||- 3 A • H + i ||A||- 5 (3(A • H) 2 - ||A|| 2 ||H|| 2 ). 

4.31 pi(X) = x\ + X 2 + *3 + X 4 = P 2 (X) = P 3 (X) and 774 (X) = x\ + X 2 + X 3 + X 4 + 
X 1 X 2 X 3 X 4 = /? 5 (X). 

Section 4.4 

4.33 V(3x 2 + 2xy + 3y 2 ) = AV(x 2 +y 2 ) gives ( 6x+2y,2x+6y ) = A(2x,2y). So we have 
to solve 

(6-2T)v + 2y = 0, 2x + (6-2A)y = 0, x 2 +y 2 = 1. 

We get 3 ; = (-3 + 4)v, v + (3 - A)(-3 + 4)x = 0, so x = 0 or 1 - (A - 3 ) 2 = 0. So A = 2 
or 4. Using 4, y = x and <2(^=, -^=) = (3 + 2 + 3)\ = 4 = Q(=±, =±). That is the 

maximum because using 4 = 2 you findy = -x and j2(^, — -^) = (3 — 2 + 3)j = 2 = 

^ VI’TA 

4.35 We solve V(||X - A|| 2 ) = AV(C • X) with C • X = 0, or 


2(X - A) = AC, C-X = 0. 


This gives 
Therefore 


A - - 

and the minimum distance is 


X = A + 5 AC, C-X = C-A + jA||C || 2 = 0. 

2C A 


lien 2 ’ 
|C-A| 
IIC|| ' 


1 2 

X 

2 -2 

y 


x = a-^c, 

IICII 2 


- .2 


4.37 We maximize f(x,y ) = [x y] 
x 2 +y 2 = 1. We solve V/ = AVg or 

(2x + 4 y, 4x - 4 y) = A( 2x, 2 y). 
Note this says that 2AX = T2X. Dividing by 2, these give 

(1 -A)x-\-2y = 0 
2x - (2 + A)y = 0. 

Either ( x,y ) = (0,0) or the determinant 


x z + 4xy - 2y z subject to g(v,y) = 


(1 - A)(-2 - A) - 4 = 0. 
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We get A = 2 or -3. Therefore 2 is the maximum, and it occurs where 


-x + 2 y = 0 . 

This gives x-2 y with x 2 + y 2 = 5 y 2 = 1, or 



r 2 1 


2 1 

A 

f 

= 2 

V! 

l 


L V5 J 


VsJ 


4.39 Let f(x,y,z ) = (x + l ) 2 +y 2 +z 2 and g(x,y,z ) = x 3 +y 2 +z 2 = 1. Find solutions 
to V/ = AWg and x 3 + y 2 + z 2 = 1. 

2(x + 1) = T(3x 2 ), 2y = T(2y), 2 z = A(2 z). 

Case 1: If y or z is not 0 then T = 1 and x = 2± V4-4(3)( z 2) _ i±_V7 

Since y 2 + z 2 = 1 - (yy ^) 3 is negative, the plus sign is not possible. Therefore only 

x = yy is possible. Then 


2 2 
y+zr : 


,-, 1 ^ 2 )’: 


l + (4±)\ 


= (1 + = (1 + +1 -n^ 1 ) 3 < <§ f * 1+(i) 3 < 3- 

Case 2: If y = 0 and z = 0 then x 3 = 1 and x = 1 and /(1,0,0) = (1 + l ) 2 = 4. 

Therefore /(x,y,z) = (x+ l ) 2 +y 2 +z 2 subject to x 3 + y 2 + z 2 = 1 has a minimum 
on the circle of points 


x = 


1-V7 
3 ’ 


y 2 +z 2 = 1 


+ ( 


V7-U 3 

3 ) ' 


Problems of Chapter 5 

Section 5.1 
5.1 

(a) velocity X'(0 = 0 , acceleration X"(0 = 0 , speed ||X'(f)|| = 0 

(b) (1,1,1), 0 , V3 

(c) (-1,-1,1), 0 , V3 

(d) (1,2,3), 0, Vl4 

(e) (l,2t,3t 2 ), (0,2,6?), Vl+4? 2 + 90 

5.3 

(a) V(0 = X'(0 = roj( - sin(o>0, cos(iot)) so V(0 • X(0 = 0 

(b) ||V(0ll = w j|(-smM),cosM))|| 


1 
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(c) X"(t) = -rco 2 (cos(cot),sin(cot)) is a negative multiple of X(4), so is toward the 
origin. 

(d) ||X"(0II = roj 2 \\(cos(ojt),sm(ojt))\\ 

-N/--'' 

1 

5.5 Differentiate X • X ^constant to find X' • X = 0. Since X(t) is on the sphere, this 
means X'(0 is tangent. 

5.7 X(t) = A + B^ + with constant F gives X'(f) = B + ^Ft and X"(t) = so 
F = mX". X(0) = A, m X'(0) = B. 


5.9 

(a) X(t) is a multiple of A, so the motion is along the line through the origin 
containing A (or remains at 0 if A = 0). Many planes contain the line. 

(b) Same as (a). 

(c) X(t) is a linear combination of A and B so lies in the plane through the origin 
containing A and B. If it happens that A and B are linearly dependent, then the 
motion is along a line (or remains at 0 ) and there are many planes containing it. 

5.11 X(0) = C, X'(t) = e"*'D, X'(0) = D, X"(t) = -ke~ kt D = -kX'(t). As t increases 

from 0 to infinity, X(t) goes from C to C + |D, so the particle is displaced ^X'(O). 

5.13 

(a) Vx ( 0 , 0 , b) - (vii +vj + V 3 k)x(/?k) = -bv\} + bv 2 \ - (bv 2 ,~bv\, 0 ) 

(b) If X(£) = (asin(ojt),acos(ojt),bt) then 

V {t) = X' {t) = (aoj cos (cot), -aco sin(ct;0 ? b), V' (0 = X" (t) = {-aco 2 sin(a;f), -aco 2 cos (cot), 0). 
and 

(. bv 2 (t),-bv\(t), 0 ) = (-baojsm(ojt),-baojcos(ojt),0) 
so V' = V x B as required. 

(c) The force and acceleration are horizontal and point toward the *3 axis. 

Section 5.2 

5.15 

(a) From (x(t),y(t)) = (a cos cot, a sin cot) we get 


2 2 2 2 2-2 2 

v + y -a cos cot + a sin cot = a 


so r = a. 

(b) From (v(0,y(0) = (0 cos 6^,0 sin cuf) we get 

(x , y) = (-aco sin cot, aco cos cot), (x", y") = (-aco 2 cos cot, -aco 2 sin cot). 

Observe that (x",y") = -co 2 (x,y). Using that with part (a) we get 

2 , -3\ v rr , y _ ( 2 , -3 


X + 


(x 2 +y 2 ) 3 / 2 


■ ( — co -ha )x, y + 


(x 2 +y 2 ) 3 / 2 


■ (-co 2 -ha 3 )y, 
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These are zero if to 2 a 3 = 1. 
(c) The constant A is defined as 


A = xy' -yx - (a cos cot)(aco cos cot) - (<a sin cot)( - aco sin cot) - a 2 co( cos 2 cot + sin 2 cot) - a 2 co. 


5.17 

(a) The error is small compared to h by definition of derivative. 

(b) See Problem 1.80. 

(c) This is direct from equation (5.1 1). 

(d) The rate of change of area is \A area/time, and T is the time required for one 
loop, so the area must be \AT . 

5.19 We are assuming 


/" + 


/ 


(/ 2 + ^2)3/2 


= 0 , 


*" + 


(/ 2 + g 2 )3/2 


= 0 . 


Replacing / by -/ has the same effect as multiplying the first equation by -1, so / 
can be replaced by -/. The original value of xy' -yx' = fg' - gf'. 

The new value is xy' - yx' = (-f)g' - g(-f') = ~(fg' ~ gf'), that is positive if the 
original value is negative. 


5.21 

(a) -pr is positive so q - y must be positive. 

q 

(b) On the y axis, r = |y|, so -p\y\ = q-y has a positive solution y = - - and a 

1 -P 

q 

negative solution y = - -. But since the orbit is left-right symmetric (due to 

1 +p 

the x 2 ), it can only be an ellipse. 

a a pa 

(c) One axis has length ----= 2--. 

1 -p 1 +p 1 -p 1 

The other must be at the middle where y = i ( ^ ^ and there 

2 \i -p i +p) i -p 2 


q 

—pr — q~y — q~ - -y 

1 -p z 



So at the middle 


Therefore the semiminor axis is 




> 2 q 2 -q 2 

(1 - P 2 ) 2 


V^ 2 - 1 


, the semimajor 


-pq 


V^ 2 - 1 

(d) Using (5.18) and (5.22) we get A = therefore 
= /t(semimajoraxis) l,/2 . 


p 2 -r 

q yq 


y p 2 - \ yp - 1 


a y~p 
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(e) Therefore \AT - 7 r(semiminor)(semimajor) = 7 rA(semimajor axis) 3/2 . 

5.23 

(a) Since V||X|| = HXIp'X, the Chain Rule gives V(±||X|r 4 ) = -||X||- 5 V||X|| = 

-nxir 6 x. 

(b) The energy jHX'H 2 - |||X|| 4 is constant; since X' = (-a sin 0, a cos 0)0' it is 


ja 2 (0') 2 - 4 ((1 + cos 0) 2 + sin 2 #) 2 = ja 2 (0') 2 - 4 (2 + 2cos 0) 2 . 

1 , 2 1 2 1 1 

(c) Using (b), -a k - —— = 0 so k = —r, or k= — 

2 16a 4 8a 6 a 3 V8 

(d) Your sketch will show that f(0) = 6 + sin 6 is an increasing function of 6 on [ 0 , 7 r], 
so it has an inverse. Then solve 6 + sin# = kt by setting 6{t) - f~ l {kt). 

(e) This motion follows a semicircle into the origin at the time t when Q(t) = n 
because at that time X(f) = (a + acos 7 r,asin 7 r) = (0,0). Since 9(f) is defined by 
6 + sin# = kt and 6(t) = n we get n + sin7r = kt , or t- nIk. 


5.25 


Y"(t) = ab 2 X”(bt) = -ab 2 ^— 
l|X || 3 


= -ab 2 


a~ l Y 

lla^YII 3 


= -a 3 b 2 


Y 


That is - 


Y 


if a¥= 1 . 


Problems of Chapter 6 
Section 6.1 


6.1 

(a) 14 

(b) Yes 

6.3 

(a) 36, f = 9; 

(b) 64, f = 16; 

(c) 39, f =9.75; 

(d) 39 >64-36. 


6.5 The /z-cubes in DU E that intersect the common boundary have total volume 
less than some multiple sh , where s only depends on the smooth common surface. 
Since this tends to 0 with h the volumes add. 
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6.7 


(a) The plate measures 15 by 21. Using the upper and lower bounds, the mass is 


between 2(15)(21) = 630 and 7(15)(21) = 2205. 

(b) D\ measures 15 by 9, and D 2 15 by 12. So bounds give 

2(15)(9) = 270 < mass(Di) < 4(15)(9) = 540, 720 < mass(£> 2 ) < 1260. 

Additive property gives 990 < mass (D) < 1800. 

(c) 2(9)(12) + 2(3)(9) + 4(5)(12) + 6(10)(12) = 1230 < mass(D) 



c Jd 



(b)pmin(-05) < I pdV = 20 < p max (.05). Therefore p min < = 400 and 


Jd 


Pmax > ^ = 400. 

Section 6.2 
6.11 

(a) I IdA = Area(D). From single variable calculus we know this area is (use 
JD 

integration by parts) 



(b) I xdA, D the rectangle 0<v<3, -1 <y< 1. The graph of z = x is a plane 
Jd 


and z > 0 on D, so xdA is the volume under the graph of z = x over D. That 



region R is a wedge. 



(c) half the volume inside the unit ball, \ 

(d) same as part (c) 

6.13 (b), (c), and (d) only 

6.15 Let D be the unit disk centered at the origin. 
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By symmetry I y 3 dA = 0 and 
Jd 

3xy + 2)dA = 2n. 


I xydA = 0 and 

JD 


X 


dA = Area(Z)), so 



+ 


6.17 

(a) Suppose f(a , b) = p> 0. Since / is continuous (take e = \p) there is an r > 0 so 
that if 

(x-a) 2 + (y-b) 2 < r 2 

then 

\f{x,y)-f{aM<\p- 

Therefore f(x,y) > \p for all (x,y) in D. 

(b) Part (b) is the lower bound property. 

(c) Parts (a) and (b) give I /dA > 0 which contradicts I /dA = 0. So / cannot 


J any 


be positive at any point of R 2 . 

(d) Apply (c) to the function -/. By part (c), -/ cannot be positive at any point of 
R 2 . Therefore / cannot be negative at any point. 


6.19 The limits on the sum are i 2 + j 2 < 10 h~ 2 . When j = 0 this gives i 2 h 2 < 10 or 
- VlO < ih < VlO. Similarly when i - 0, - VT0 < jh < VlO. So it is the second 
integral. 


6.21 

(a) The sine function only takes values from -1 to l, so these are bounds on the 
integrand. 

(b) J = .423 + J^sin(jrj—^j^ 2 y)dA, where C consists of all points of the square 
that are within .001 of the right edge or top edge. Since Area(C) < .002 the last 
integral is between -.002 and . 002 , and we get 


.421 < J < .425. 


6.23 

(a) (i) g < f is given. 

(ii) Use Theorem 6.8 part (c): since 0 <f-g, 0 < f(f~g) dA 

(iii) Use Theorem 6.8 parts (a) and (b): J(f-g)dA = f fdA- f gdA 

(iv) Add f g dA to both sides of the inequality. 

(b) (i) Because CcD. 

(ii) Create a Riemann sum for I fdA using the same points as for the Riemann 

Jd 

sum for J r /dA, then any additional terms in the D sum are nonnegative 
because / > 0 . 

(iii) The sums for D tend to J /dA, and they are all larger than the C sum. 

Jd 
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(iv) By part (c), J' fdA is an upper bound for the C sums, so the limit J" fdA < 

f fdA. 

JD 


Section 6.3 


X I ( r* Vl— x 2 N r* 1 

,1 vd ’ d "iJ 


6.25 


\(l-x 2 )dx= |(2— |) = | 


6.27 


(a) In D, x < 0 < yj, so x < ^]y + x< yjy. Therefore 

f xdA < r(y^ + x)dA< I" yydA. 

Jd Jd Jd 


JD JD 

(b) In D, -1 < x <y - \ and 0 < y < 1 


j' D xdA = £[£ i dy = ^ 1 i((y-l) 2 -l)dy=i[i(y-l) 3 -y] ( J = -i + i=-i. 

= s/y daj dy = y/y(y -1 -(-1))dy = [fy 572 ]* = 

f (tf- 

Jd 


; + x)d A = | 


6.29 


I 


sin^dA =11 sinjdx 

JO [Jx=y 2 


dy 


-JV 


y 2 )sinjd}; 


= [(l-J 2 )(-cosy)]^-^ 2ycosydy = l-2|[ysiny]^-^ sinydy 

= 1 -2sin(l)-2(cos(l)- 1) = 3-2sin(l)-2cos(l). 

6.31 1,3,2,5,1: 


r r( 2) /~l /~z 

(3xy 2 + 5x 4 y 3 )dA = (3) xdx y 2 dy + (5) x 4 dx 

J[0,2]x[-l,(l)] Jo J-l Jo J-l 

Section 6.4 

6.33 

(a) Corresponding parts of the triangles are proportional, so 

kh 2 + he he 


Al) 


y 3 dy. 


£ + ha ha 


Solve to get i = 


h 2 ka 


U)|tO 
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(b) Similarly — = —--, giving m = hb + -kh 2 . 

hb he c 

(c) The change in area is 

A m(£ + ha)- Uhahb) = l(hb+ -kh 2 )( - +ha)~ Uhahb ) = ifc—/i 3 + U— h 4 . 

z z cc cc 

If h is small this is less than a multiple of h?. 


6.35 det 


2u -2v 
2v 2u 


= 4(u 2 + v 2 ), that is 4 at (1,0). The vertices of the triangle map to 


(1,0), (1.21,0), and (.99, .2), so the area is about four times as large. 


6.37 


r* nln r*b pi 

I e -||X|l dA= I I e _r rdrd^= I d 0 

*Jd JO Ja Jo Ja 


e r rdr. 


Using integration by parts, 


f 


e r rdr = 


(-l)re 


b rb 

+ I e -r dr = 
Ja 


= (-l-Z?)e^ + (l+a)e“ fl . 


So 



dA = 2 tt((—1 - b)e~ b + (1 + a)e~ a ). 



Fig. 9.21 Initials in Problem 6.39. 


6.39 See Figure 9.21. (a) stretches horizontally by a factor of 5, and 3 vertically 

(b) interchanges u , v then goes by (a) 

(c) right edge (u = 1 ) goes to the bottom (y = 0 ), and left edge to the top 

6.41 

(a) (5)(3)+ yy =9 15 + (15) 2 ii yes. 

(b) off by a minus sign 

(c) (5)(3) + f f =? 15 + 15 2 (i)(l - i) yes. 

Section 6.5 


6.43 D n = [0 ,n] x [0,1] is an increasing sequence of bounded rectangles whose 
union is D. 


(a) 


f e~ x dxdy= f 

JD n JO 


(~e~ n + l)dy —»1 so 


L 


c dxdy = 1 . 
















Answers to selected problems 


455 




so I e ^ curdy = 2. 

JD 

(c) In each subrectangle where ^ < y < | you have e _x;y > Q~ xk l n . Then 


so | e x ^dvdy = 2. 


'd 



'o Jo W 


e Xy dydx> f -e X ^dx = |(1 -e *)>^. 


Adding these, I e x>7 dA > ^(1 + \ + ^ + ••• + ^), which diverges. So | e ^dvdy 
JD n JD 

does not exist. 

6.45 The third and fourth only. 



6.49 First proof: If /(X) = p > 0, then by the definition of continuity (take e = |) 
there is a small disk centered at X where / > |, so / = /+ in that disk. Therefore /+ 
is continuous in that disk, and in particular at X. If /(X) < 0 there is a disk where 
/ < 0, so in that disk /+ = 0. Therefore /+ is continuous in that disk, and in particular 
at X. If /(X) = 0 let e > 0, then there is a small disk centered at X where |/| < e. 
Since /+( Y) is equal to either 0 or /( Y) at every point Y of that disk, we also have 
|/+(Y) -/+(X)| = |/+(Y)| < e there. Therefore /+ is continuous at X. 

Second proof: f+ = \(J + 1/|) is a sum of continuous functions, so it is continuous. 


6.51 

(a) The set is the part of the rectangle [0,1] x [0,2] below the line y = 2-x. 

(b) 




6.53 The probability that (x,y) is not in D is 1 minus the probability that (x,y) 
is in D. 

6.55 



converges as n —> oo. 
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(b) e r goes to zero much faster than-- so it is also integrable. 

1 + r 4 

(c) Since \x\ < yjx 2 +y 2 = r, this function is integrable by comparison with part (a) 
also. 

(d) 

p27r pn pn pn 

I I (ye~ r ) 2 rdrd6 = 2n I y 2 e _2/ Vdr < 2n I r 2 e _2r rdr 

Jo Jo Jo Jo 

J pn pn pn 

Q~ 2r r > dr<2n I e _r max(e _r r 3 )dr < (const.) I e _r dr 

o Jo Jo 

converges as n —» oo. 

Section 6.6 
6.57 

(a) a 5 by definition. 

(b) J" x\dx\(cfi) = ^a 7 by iterated integral. 

(c) ^a 7 - ^a 7 +la 3 x$dx$ f Q a xydxy = la 3 (^a 2 ) 2 = |a 7 by part (b) and iterated 
integral. 

6.59 

(a) 


1 


XZ 2 dV : 


pZ ^10 

J, iy L 


[ 1,2] x [3,5] x [—1,10] 


9 3 1000+1 

z 2 dz= -(2)---= 1001. 


(b) 


p px=z py=j pz -1 u px=z py=j 

xz 2 dV= I I xz 2 dzdydx = I I ^x(1000-(v+y) 3 )dydx 

Jd Jx= 1 J_y=3 Jz=x+y Jx= 1 Jy=3 

px=2 px=2 

= I ixflOOOj-iU+j) 4 ] 7 " dx= I ±x(2000-±((x + 5) 4 -(x + 3) 4 ))dx 
Jx=l ^ Jx-1 

p2 

= ij (2000x- |(4x 3 (5-3) + 6x 2 (5 2 -3 2 ) + 4x(5 3 -3 3 ) + 5 4 -3 4 ) dx » 821.6 


6.61 Using spherical coordinates 


f e“^ 2+ >’ 2 « 2 dV = 4^ f e- p p 2 dp = 4n\(-p 2 -2p-2)e- p ] R = 4n((-R 2 -2R-2)e- R + 2) 
J||X||<fl Jo I- Jo 

.63 

rn I 

(a) I e -2p dp = --(e“ 2w - 1) tends to i as ft tends to infinity. 

Jo 2 


(b) Integrate by parts. 
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(c) Let n tend to infinity in part (b). 

(d) Using part (c) and (a) we get i\ = ^/ 0 = Using part (c) again 4 - \i\ - 

Then 4 = So 4 = ^4 = 

6.65 



J r*27T p;r r>n r n/7 

I I e~ p p 2 sin<pdpdtpdd= lim 47 t -e _p (p 2 + 2p+2) 
o Jo J P L 


4^e p (jr + 2p+2). 


6.67 


(a) 

(b) 

(c) 

(d) 



Vol([0,2]") 



Axi ■ ■ ■ dx n 


I A dx 2 • • • dx n = A 
J[0,]j"-' 


Problems of Chapter 7 
Section 7.1 


7.1 Write X(f) = A + f(B - A), Y (u) = B + ±n(A - B). 
Then ||X'(0II = IIB-AII, ||Y»|| = £||A-B|| = |||X'C0II- 


(a> 

(b) 


f ds= f ||X'(0lld?= ||B-A||, and f ds = f ||Y»||dw = 2(±)||B-A||. 

Jc Jo Jc Jo 

f yds = f (a 2 + t(b 2 - a 2 ))l|X , (f)||dt = (a 2 + \{b 2 - a 2 ))||B - A|| 
Jc Jo 

= \{a 2 + £> 2 )I|B - A|| and fyds = £ (b 2 + \u(a 2 - b 2 ))||Y»||dn = (2b 2 


JC Jo 

+ ^(a 2 - b 2 ))\||A - B|| = i( fl2 + & 2 )||B - A||. 


4 V^Z U U)2 11 II — 2 V 

(c) Both parametrizations agree about the integral of y and about the length of C, 
so both give average = ^(<22 + Z 4 ). 

7.3 X(0 = X(ks) = Y (s) = A + ks (B - A). 


Length (C) 


= f IX' 

Jo 


(Oil dr = IIB-AH. 


and Y'(A) = &(B - A) is a unit vector if k = ||B - A|| 1 . Then since 0 < t < 1, s runs 
from 0 to 1 /k = ||B - A||. So Y(Length(C)) = B. 


7.5 Parametrize the segment C from A to B as X(t) = A + 1 (B - A), 0 < t < 1. Then 
the average of Xf is 





458 


Answers to selected problems 


J c Xids 
Length (C) 


X 


7.7 y 2 dx + xdy 


I|B-A||X <ai 

= / 

J(0 


+ t(bi-ai))\\B-A\\dt = 


ait+h 2 (bi-ai) 


— o (fti 


y 2 dx + xdy + 


X. 


(0,0)^(1,0) 


X 


y 2 dv + xdy 


( 1 , 0 )->( 1 , 1 ) 


y dx + xdy 

1(1, i)->(0,0) 

Parametrizations for the 3 segments are (x,y) = ( t , 0), (1,0* (1 - 1 ,1 - 0 with 0 < X < 1 
in each case. We get the integral 


: f ( 0 dX + 0 )+ f ( 

Jo Jo 


(0dr + 0) + J ((f z )(0) + ldt) + J^ ((1 - tf + (1 - t))(-dt) = 0+ l- j- 5 = g. 

7.9 G N = (/ 2 ,-/i)• (f 2 ,-fi) = M +/iti = F T. 

7.11 X'(f) = (l, f(t)) so ||X'(f)|| = ^/l + {f'(t)) 2 . For the segment y = 3x,0< x < 1, 

I Vl+3 2 df = Vl + 3 2 
Jo 

agrees with the length that is given by the Pythagorean theorem. 

7.13 

(a) A rotation by 7 r /2 changes x 2 to y 2 and takes C\ to itself without stretching, so 


l l _ l 


f x 2 ds= f 

Jc 1 Jc 1 


/ds. 


Therefore 


JCi 


(x 2 +y 2 )ds = f 

Jc 


Cl JC! 

x 2 ds. 


(b) Same reason. 

(c) Holds because (x 2 +y 2 ) 10 = 1. 


Ci 


7.15 

(a) 


Jp dx = J~(y,0)-T ds. The integrals have opposite signs for physical reasons 

and because F = (y, 0) is in the direction of T on K roughly, and opposite that of 
T on E, that is, F • T > 0 on K and F • T < 0 on E. 

(b) The coordinates in the figure are x,p instead of x,y, so the work done is the 

same as I y dx. Problem 7.14 shows that this is the area between the graphs. 


7.17 


(a) This is because for small A 6 the segments are small, so can be used in a good 
Riemann sum approximation. 


(b)U=^±=W±. 
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(c) The factors T As are approximations for arc length segments, and the dot 
products give the projection in the W direction. Dividing by r gives the change 
in 6 by definition of radian measure. 

(d) This combines parts (a) and (c). 


7.19 


(a) 

(b) 

(c) 



xdx = ^3 2 , yes equal 

xd(^x) = f ^3 2 , not equal to (3)(4). 

dv + 5d(|v) = (1 + 5(|))(3), yes, equal to 3 + (5)(4). 


Section 7.2 


7.21 

(a) An antiderivative for g x = —-- is i log(x 2 +y 2 ) + h(y). But checking g y we 

x 2 +y 2 

see that we can take h- 0. So a potential is g(v,y) = \ log(v 2 +y 2 ), defined on 


0. 


„-i^ 


(b) Due to algebraic simplification the rules for V tan (-) and F(v,y) are the same 
where their domains agree. But the domain of tan -1 (-) is R 2 minus the y axis, 
l 2 -0. So 

-y x \ 


not 1 


Vtan -1 (^) t ( 


x 1 +y 2 ’ x 2 +y 2> 


7.23 


(a) Not. 

(b) [xy - 3 z 2 cosyJlo o!o> = ab ~ ^ cos b ’ 

(c) [x + y]™% = a + b 

7.25 

(a) The integral is equal to J VOIXIf 1 ) -Tds = [||X|r']^ = ± - 

(b) The integral has not been defined since the integrand is not defined at (0,0,0). 
7.27 The Chain Rule together with ||X||“ 3 X = V(-||X|r') gives 


nx-pir 3 (x-p) = v(-iix-pir 1 ) 


for any constant P. Then 

ciix-pir 3 (x-p) = v(-ciix-pir 1 ) 


for any constant c. Take k different such expressions and add them to get 


k k 

2 c,nx - p ; ir 3 (x - p,o = v( - 2 C; iix - p jir 1 ). 


7=1 


7=1 
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7.29 

(a) By the Chain Rule, -F(X) = VdlXH" 1 ) and F(* + h,y,z) = V(-\\(x + h,y,zW l ), 
so F(x + /t,y,z)-F(X) = V(-||(x + /t,y,z)ir 1 + ||X|r 1 ). 

(b) The limit is J^F which is conservative because of the interchange of mixed 
partial derivatives: J^Vg = Vj^g. 

(c) is conservative because of the interchange of mixed partial derivatives: 
&8 = Vfa- Similarly for |G and |G. 

Section 7.3 


7.31 Let f(x,y ) = x 2 +y 2 and g(x,y) = x 2 -y 2 . Then V/(v,y) = (2x,2y) and Vg(x,y) = 
(2x, -2y). The areas of the graphs are 


U 


l+f? + fydA = J ■ s jl+4x 2 + 4y 2 dA 


and 


J^ a J l+ 8x + gydA = yfl+4x^+4y^dA. 

These are equal. 

7.33 S is the graph over D of z = d-ax-by ). So z y = ~~ c and 


Area (S'): 


(4 + 4 + ldA 

<. C A C A 


: a/ 1 4^ + 1cL4 = h Area(D )- 


7.35 

(a) X m (m, v)=( V 2 v, 2 u, 0 ), X v (m, v)=( V2 u, 0 ,2v), X u x X v =(4 uv, -2 V2v 2 , -2 V2 u 2 ) 

(b) X M and X v are bounded because 1 < w < 2, 1 < v < 2, and are linearly 
independent by inspection since u and v are not zero. X is one to one because if 

( V 2 UV,U 2 ,V 2 ) = ( V2M1V1, u 2 v v\) 

with u, u\ , v, v\ positive then 


u\ - ^u\ - yu 2 = u, 


vi 




So the range S of X is a smooth surface. At any point of S , (v,y,z)=( V2 uv, u 2 ,v 2 ), 
you have x 2 - 2yz = 2u 2 v 2 - 2u 2 v 2 = 0 . 

(c) The area of S is 

f d(r = fifi l|X “ X Xvlldwdv = £ ^ ^ 4w ) 2 + (-2 ^v 2 ) 2 + (-2 V2w 2 ) 2 dwdv 


-n>‘ 


-rj> 


?v 2 + 8 v 4 + 8 w 4 dw dv 


i / 2 + v 2 )dwdv 
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= V8 J' [|w 3 +v 2 w] _^dv = V8 J' (| + v 2 )dv = V8(| + |)= V8y. 


(d) 


w 2 + v 2 )dwdv 


f yda = f f m 2 ||X„ xXJdMdv = f f u 2 ^ 8 (, 

Js J 1 Ji J 1 J 1 

= V 8 J 2 [i M 5 + ivV]' =i dv= Vs £(f + Zv 2 )dv= Vs(^ + g). 

7.37 Define G(x,y,z) = (-x,-y,-z). Using any parametrization Xi(m,v) = ( x\(u,v ), 
yi(M,v),zi(w,v)) we get 

J" x 2 ydcr= J ~ J" x^(w,v)yi(w,v)|| • • • ||dwdv 

and using X 2 (w, v) = G o Xi = (- xi (u, v), -yi (w, v), -zi (u, v)) we get 

J" x 2 ydcr = J" x 2 (u,v)(-y\(u,v))\\ • • • ||dwdv. 

Since these are equal the integral is zero. 


7.39 

(a) x 2 + x 2 +x 2 = l. Square this to get 

1 = (x 2 + x\ + X 2 ) 2 = X 4 + x\ + X 4 + 2(XjX^ + X 2 X 3 + X^Xj). 


(b) X 3 = z = cos 0 , so 


/ r»27T r*7T 

x 4 dcr = J J (cos 0) 4 sin 0 d 0 d^ = ^(-cos 5 ( 7 r) + cos 5 (0))27r = | 7 r 

(c) Integral of expression (a) using symmetry and Area(S) = 47 T gives 


3( 


^n) + 2(3) J" x 2 x 2 dcr = 47 t, 


SO r XjX^dcr = ^(4- y)7T = j^7T. 

Js 2 

7.41 (a) F • N Area (S') = (2,3,4) • (0,-1,0)(1) = -3 

(b) -3 + (2, 3,4) • (0,0, -1 )(2) = -3 - 8 = -11 

(c) +ll 

(d) -11 + 11 =0 

7.43 Use parametrization X(w,v) = uV + vW = (u,u,u + 2v), 0 < u < 1, 0 < v < 1. 
Then X M = V,X v = W,X M xX v = VxW = (2,-2,0), and the flux is 
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/ 


F • (X u xX v )dudv 


f f'(2 (h)(2) 

Jo Jo 


+ 3(a + 2v)(-2))dadv 



(-2a- 12v)dadv = -1 - 6 = 


-7. 


7.45 

(a) Multiplying we get 


[A BY] 


a b 0 



~yi 

c d 0 

= 

aA + cB Z?A + (iB 

y2 

0 0 1 



A3. 


This is equal to [C D Y] by assumption. 

(b) Take the determinant of each side and use the multiplication rule for 
determinants. Use the definition of cross product det [PQR] =R (PxQ). 

(c) Use the fact that if Y • X = Y • Z for all Y, then X = Z. This can be seen by taking 
Y = (1,0,0), (0,1,0) and (0,0,1). 

7.47 

(a) f (p\)\ • Ndcr = f (p(a,b,c))(a,b,c) • ( 0 , 0 ,l)dcr = p(a,b,c)c f 

S S s 

d cr - pcA(a,b,c). 

(b) f (pV)V-Ndcr = f (p/fcN)£N-Ndcr = pk 2 N f dcr = pic 2 AN. 

Js Js Js 

7.49 Denote by a the radius of S , then N = X/a, and on S , ||X|| = a so 

f F • N dtr = f* X dtr = f ^—d cr = a~ 2 (4na 2 ) = 4 n. 

Js a 5 a Js a 4 

7.51 Y(a,v) is a constant times the parametrization X(a,v) so Y is differentiable on 
the same set D , is one to one, and Y u = kX u and Y v = kX v are linearly independent. 
Therefore Y is a parametrization and T is a smooth surface. Also 

Y u (u, v) x Y v (a, v) = (kX u ) x ( kX v (u , v)) = k 2 X u x X v (m, v) 

so 

Area(T) = f ||Y M x Y y ||dadv = f k 2 \\X u xX v ||dadv = k 2 Area(5). 

Jd Jd 


Problems of Chapter 8 
Section 8.1 

8.1 One way is shown in Figure 9.22. D\ = R-U is a union of 6 parts that are x 
simple and y simple. - S - R is a union of 4. 
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Fig. 9.22 Figure for Problem 8.1. 


8.3 The divergence of the field is 2, so flux across C is 

(a) 2 Area (disk) = 27r, 

(b) 2 Area (rectangle) = 2 (b-a)(d-c). 

8.5 divF = 2 xy 2 

J divFdA = £ £^£_2xy 2 dydx = £ y (1 -x 2 ) 3/2 dx = [f(l-* 2 ) 5/2 5 T-]' = 

Parametrize the vertical part of the boundary as (v,y) = (0,1 - 1), 0 < t < 2 and the 
curved part as 


Then 


(x,y) = (cos f, sin f), < t < |. 



F-Nds 



r>n/2 

I ( -dt)+ I (1 +cos 2 rsin 2 0cos^dr 
JO J-tt/2 


(*/2 

= - 2 + I (1 + (1 -sin 2 0sin 2 0cosrdr = -2 + 

J-n/2 


1 • 3 

sinl+^ sin t- 


^ sin 5 1 


nj2 

-n!2 


= -2 + 2(1 + i-i) = 


4_ 

15* 


8.7 


(a) f (1,0)-Nd^ = 

f div(l,0)dA 

JdD 

Jd 

r* 

(b) (0,1)-Nds = 

OdA = 0 

JdD 

Jd 


f 


OdA = 0 
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(c) So J Nds = ( | n\ds, f n 2 ds) = 0 

JdD V JdD JdD 

(d) N i is constant on D t so I N/ ds = N ; - Length (Df) = ||P Z - - P*_i ||N/ 

JDi 

(e) (; D Ndv = 0 = Z f Di N,-d.v = Z ||P, - P,_, ||N ( . 

8.9 If the loop does not surround (0,0) then the integral is zero by Green’s Theorem 
since curlF = 0 throughout the region enclosed. If the loop does surround (0,0) then 
the integral is 2 n by Example 8.9. 

8.11 

(a) Adding the product rule (gfj) Xj = g Xj fj + gfj, Xj for j= 1,2,3 you get 

div(gF) = gdivF + F • Vg. 

(b) Apply the Divergence Theorem to gF. The integral over the boundary is zero 
since g = 0 there, so the result follows from part (a). 

(c) This follows from part (b) because when F = Vg, divF = div Vg = Ag. 

8.13 Suppose there were a periodic orbit. Denote by D the region with the orbit as 
its boundary. Then 

0 < f divFdA = f F-Nds. 

JD JdD 

Use the periodic solution to parametrize the boundary. That could be either 
clockwise or counterclockwise, but in either case Green’s Theorem gives 

0 A [ P f(x(t)MO)dy-g(x(t),y(t))dx = Cx'(t)y'(t)dt-y'(t)x'(t)dt = 0, 

Jo Jo 

a contradiction. 


Section 8.2 


8.15 Since divX = 3, the integrals are 3 times the volume of D. (a) An, (b) 4 nr 3 . 


8.17 

(a) 

(b) 0 

(c) f 

J bs 

(d) 

(e) 0 




(l + 3 + 5)dV = 9 (|tt8 2 ) 

(l)dV = f?r8 2 

(lx) dV = 0 by symmetry. 
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8.19 

(a) This is a property of the dot and cross product, 

(N x F) • C = det [N F C] = N • (F x C). 

(b) Use the Divergence Theorem. 

(c) Use 

div (F x C) = div (/ 2 c 3 - / 3 c 2) foc\ - f\c 2 ,fi c 2 - f 2 c\) 

= (/ 2,1 -/l,2)C3+(-/3,l +/l,3) c 2 + (/3,2-/2,3) c l = (curlF)-C 
and part (a). 

(d) Use parts (a) and (c). 

(e) Factor out the constant C. 

(f) Since C was any constant vector that means the vector in parentheses in part (e) 
must be the zero vector. 


8.21 

(a) By the Divergence Theorem 


., j (1,0,0)-Ndcr = j OdU, or j nidcr = 0. 
JdD JD JdD 


(b) Using (0,1,0) and (0,0,1) similarly each component of 


f 1 


dD 

Ndcr is zero. 


(c) A face S of D is a flat surface with a constant normal vector. So 


X 


Ndcr = N 


X 


dcr = Area (S')N. 


8.23 

(a) all points of R 3 except the 

(b) divF = 0 so by the Chain Rule divG(X) = cidivF(X - A x ) + • • • = ci(0) + • • ■ = 0 

(c) J F(X) • N dcr = 47r if 0 is in the interior of D and 0 is 0 is in the exterior, so 
Jd 

f G-Nd o-=Yc k f F(X-A*)-Ndcr= V 4 nc k . 

JdW k JdW _, AjfinW 

0 or 4n 


8.25 For C 2 functions 


div(vVtz) = ( vu x ) x + (vuy) y + (yu z ) z = v x u x + v y u y + v z u z + vu xx + vu yy + vu zz = Vv-Vu + vAu 
so the Divergence Theorem in a regular set D 


j di\(vVu)dV = j i 
Jd JdD 


vVu - Ndcr 
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f (VvVm + v^m) dV= f \ 
Jd K ; JdD 
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vVu -Ndcr. 


Taking v-u gives part (a). Since Vu-Vv = Vv -Vu, we can interchange u and v and 
subtract to get part (b), 


X 


(vAu-uAv) dV = I (vVm- t/Vv) -Nd<x. 

D JdZ) 


8.27 The identity 


gives 


f(/zl/+|V/| 2 )dV = f fVf-Ndcr 

JD JdD 


f (i/ 2 + |V/| 2 )dV = 0. 

JD 

If T > 0 then that integrand is nonnegative, so it must be zero in D. Therefore / is 
zero in D. 


Section 8.3 
8.29 

(a) f F-Tds = f curlF-Nd<x = f (0,0,1 

JC Jdisk Jdisk 

(b) Since C is oriented from A to B to C to A we need N to point away from 0. 


, 1) • (0,0, l)dcr = 7r 


f FTds= f 

JC J tria 


-/ 

Jim 


curl F • N dcr w w 

JC J triangular surface triangular surface ll(B-A)x(C-A)|| 


((».,». i, ,!!! A)x(( l A) „ d, r 


-j. 


(0.0.-1)- {bC -“- ab) dcr 

triangular surface ll(B — A) x (C — A)|| 


Area (triangle) = - —— A ^ rr , —^ J, ||(B - A) x (C - A )|| = —\ab 


ll(B — A) x (C — A)|| 


ll(B-A)x(C-A)|| 


8.31 


(a) curlF = (0,0,2). On the hemisphere z = g(x,y) = r 2 - x 2 - y 2 and N = (x,y,z)/r. 


I I x 2 y 2 v 

dcr= yjl+g 2 x + gjdxdy= + - +-dxdy = ^—^dxdy. 


g(x,y ) 


So 


curlF-Ndtr = | (0,0,2). <lcr = f 2 S -^> - 0,0, 


f curlF-Ndcr = f (0,0,: 

Js Js 


d r g(x,y) 


where D is the disk of radius r. This is equal to 2 Area (D) = 2nr 2 . Since z = 0 
on dS , the line integral is, using Green’s Theorem and dS = dD, 
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| F-Tds= J -ydx + xdy + 2dz = J -ydx + xdy = J 

Jas JdS JdS Jd 


2 dA = 2n r l 


JdS JdS JD 

(b) The line integral is the same as in part (a) and the surface integral is 


X 


(0,0,2) • (0,0, l)ds = 2 Area (D) = 2{nr z ). 


8.33 Write the sphere as the union of hemispheres H and K. Then H and K have 
the same but oppositely oriented boundary circle, so by Stokes’ Theorem applied to 
each of H and K , 


[ curlF-Ndcr= f curlF-Nd<x + [ curlF-Ndcr= [ F-Td s+ f F- 

Js Jh Jk JdH JdK 


Tds = 0. 


8.35 


(a) The Maxwell equation gives 0 = (0,0,2)ci -yUoJ, so c\ - \noj. 

(b) We know that curl () = 0 and J = 0 so p = 2. 

(c) We need for continuity on r = R that \noj = jp, so C 2 = ^o7^ 2 - 

(d) The surface integral side of Stokes’ formula is the sum over the continuous parts 


| curlB-Ndcr= j yUoJ-Ndcr+ i 

JD Jr<R Jr± 


yUoJ-Nd cr+ I 0-Ndcr = jiojnR 2 +0. 

r<R Jr>R 


The line integral side of Stokes’ formula is, since T 


_ (-y,*, 0 ) 

“ Ri 9 


f B Td s= f c 2 — -Tds = Ij/iojR 2 C ± 

JdD JdD r z JdD K l 


1 


2nR\ 


= 2 W " R, 


=HojnR 1 


di 1 


These are equal, so Stokes’ formula holds in this discontinuous case. 

8.37 

(a) Stokes’ Theorem applied to Ampere’s original law gives 


f curlB-Ndcr = yuo [ J-Ndcr 
Js Js 


for all S , so curlB = yttoJ. 

(b) p t = -div J = -(yuo) _1 divcurlB = 0 does not allow any time dependence of the 
charge density, 
d 


8.39 


dt 

X ( - cu 


r b • Nd<x= r 

Js Js 


B Nd<x= B t • Ndcr. By Maxwell’s equation this is equal to 


curlE) • Ndcr. By Stokes’ Theorem this is equal to 


-/> 


E-Tds. 
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Section 8.4 

8.41 The left hand side is zero because the time derivative is zero. The integral 
over the boundary of the cylinder becomes -p(b)u(b)A( 1,0,0) at the right end plus 
-p(a)u(a)A(- 1,0,0) at the left, so the conservation law is 

0 = ( -p(b)u(b) + p(a)u(a)}(A, 0,0). 


8.43 


V • V(V • V) = u(u 2 + V 2 + w 2 ) x + v(u 2 + v 2 + w 2 )y + w(u 2 + v 2 + w\ 

= 2 u(uu x + vv x + ww x ) + 2 v(uu y + vv y + ww y ) + 2 w(uu z + vv z + ww z ) 

= 2 u(uu x + vu y + wu z ) + 2 v(uv x + vv^ + wv z ) + 2 w(uw x + vny + ww z ) = 2 V • (V • VV). 

8.45 (c) The acceleration is to the left in the left figure, and to the right in the others. 

8.47 

(a) V(X,0) = X, V(X, 1) = ^X, both radial outward, slower after one second. 

(b) divV = y^di v(x,y,z) = is not zero, so the flow is compressible. 

(c) p t +pdivV + V • Vp = a(l + t) a ~ l ||X || 2 + (1 + 01X|| 2 ^ + • ((1 + t) a 2X) 

= (1 + i) a ~ x ||X|| 2 (a + 3 + 2). So take a = -5. 

849 ck , ck 

e t + div (eV) + Pdiv V = — yp y p, H-div (p r V) + kp y div V 

R R 

= %yp y ~ x pt + ~div (p r_1 (pV)) + kp y div V 
K K 

= —7P y ~ l Pt + — (p r_ 1 div(pV) +pY • V^- 1 )) + kp r divY 
R R \ ) 

ck ck / \ 

= — yp y ~ l p t -\ -(p r_ 1 div(pV) + (y— 1 )p r_1 V • Vp] + ^p r divV 

R R \ ) 

Using the mass equation p t = -div(pV) that becomes 

= — (-y + 1 )p r_ 1 div (pV) + — (y - 1 )p y ~ 1 V • Vp + kp y div V = — (-y + 1 )p r div V + kp 7 div V. 

R R R 

This is zero if y = 1 + f. 

8.51 p t + (pu) x = eg t + (pop/ + e 2 /g) x = e(gt +Po/x) if we ignore 6 2 . 


u t + ww x + kyp Y 2 p x = eft + (ef)(ef x ) + ky(po + eg ) 7 2 (eg x ) = e(f + kyp 7 2 g x ) 

if we ignore £ 2 , because the Mean Value Theorem gives («a + eg) b =a b + b(a + 6g) b ~ l e 
for some 6 between 0 and e. Then 
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gtt = -pofxt =Po(kypl 2 )g xx - 


Section 8.5 

8.53 Using 


d_ 

dr 


r 


we get Vg = (- f(r, t ), f(s, t), J f, dx). 
Vg-(a',b',\). 

X 1 


fix) dx = -f{r) 

Then the Chain Rule gives the result as 


8.55 pt+ppx = ~ 


(t + 8) 2 t + 8 t + 8 


= 0 . 


P 


x 

-1 0 

Fig. 9.23 The graph of p(x, 0) for Problem 8.57. 



8.57 

(a) The three parts of the formula for p are continuous in their respective sets, and 
agree on the common boundaries of those sets, so p is continuous. 

(b) The graph of p(x , 0) is shown in Figure 9.23. Your graph of p(x , 1) ought to show 
the constant 1 for v < 0, the constant 0 for x > 0, and a point of discontinuity at 
x = 0. 

(c) In the regions where p is constant, the partial derivatives are zero, so p t +pp x = 0. 

x 

In the region where p =-, we have 


Pt+PPx = 


(t-iy t-it-i 


= o. 


(d) In 


p(X{) + (-Xo)t, t) = -X() 


set x = xq + (- 


p{x,t) = - 


X 


8.59 The Fundamental Theorem gives 



(Pt + fx-g) dx = 0 


and it holds for all [ a,b ], with continuous integrand, only if that integrand is zero. 
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8.61 si = j(l +0) = j, s 2 = j(§ + D = f, * = 5(3 + |) = |, J4 = 5(0+ |) = 

* = 5(0 + 3)=!, 

8.63 x-21 and v = 3 +1.5 1 intersect, have the same x, t values, when x = It = 3 +1 .5t. 
That gives t = 6, then v = 12. 

8.65 The curves satisfying x' - x are (v,0 = {ct f ,t), one for each number c. These 
rise from (c, 0) on the horizontal v axis and separate as t increases vertically, p is 
constant on each such curve when 

^p(*(0, t) = p x x' +p t =p t + xp x = 0. 

If p(x, 0) = x 2 we get p(ce*,0 = p(c,0) = c 2 = ((ce ? )e _? ) 2 . Therefore 
p(v,0 = (ve _ 0 2 = x 2 e~ 2t . 


Problems of Chapter 9 


Section 9.1 


9.1 Since u xx > 0 the wave equation implies that also u tt > 0, so the acceleration is 
upward. This agrees exactly with our derivation of the equation based on tension 
forces, because these forces on the two ends of any bit of the string have an upward 
resultant when the shape is convex. 


9.3 u n = - A \( c i sin (^) cos(A 4 0 ) - E 2 (c 2 sin (^§5^) cos(E 6 £)) and 
Uxx = 1 sin ( 335) cos (^ 4 f)) - ( ^0^) 2 ( c 2 sin ( ) cos(£ 6 0 )- Then 

Utt = C 2 u xx gives 


c 


2 





E 2 

Divide these to get —f = 3 2 , so = 3 A 4 . 
A 2 


9.5 

(a) speed 3 to the left 

(b) speed 3 to the left 

(c) speed 1/4 to the right 

9.7 By the Chain Rule, u t = -cf\x - ct) + cg'(x + ct), u x = f\x - ct) + g'(x + ct ), 
u tt = c 2 f"(x - ct) + c 2 g'\x + ct), u xx = f\x - ct) + g"(x + cf). 

Therefore u tt = c 2 w xx . 


9.9 u t = -c sin(v + ct), u x = - sin(v + ct), so u 2 + c 2 u 2 = 2c 1 sin 2 (v + ct). 
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(a) £(0) = c 2 I sin 2 xdx 

Jo 

X 7T—C 

sin 2 (x + c) dx. E( 0) is larger because 

r*7i—2c r*7T 

E(l) = c 2 I sin 2 0d0<c 2 sin 2 0d<9 = E(O). 

Jo Jo 

9.11 

(a) u(x,0 ) = sinx = f(x) + g(x) and u t (x,0 ) = 0 = -cf'(x) + cg'(x). Then /' = g' so 
f = g + C, then sinx = 2g(x) + C. So 

g(x) = \ (sin x - C), /(x) = \ (sin x + C), w(x, t) = \ sin(x - ct) + \ sin(x + ct). 

(b) Here c — 1. u(x , 0) = 0 = /(x) + g(x) and u t (x, 0) = cos(2x) = -/'(x) + g' (x). Then 
/ = —g so cos(2x) = 2g'(x). Thus g(x) = | sin(2x) + C. So 

g(x) = | sin(2x) + C, /(x) = - J sin(2x) - C, w(x, 0 = -1 sin (2(x - t)) + J sin (2(x + t)). 

(c) Similarly 

u(x , 0 = | ( sin(x - 50 + sin(x + 50) + ^ ( sin (3(x - 50) + sin (3(x + 50)) 

+ ^ ( - sin (2(x - 50) + sin (2(x + 50)). 

9.13 u(x, 0 = -g(-x + ct)+g(x+ct) and its derivatives are nonzero only when -x+ct 
or x + ct is in the short interval where g A 0. As t tends to infinity, with x confined to 
[0 ,p], neither of those numbers is in that interval. Therefore the integral is zero. 

Section 9.2 

9.15 

(a) u tt + = c 2 (u xx + + u yy + v yy )\ add the wave equations for u and v 

(b) Let u = kz. Then u tt = kztt = kc 2 Az = c 2 A(kz) = c 2 Au. 

(c) Set w(x,y,0 = z(-y,x,t). Then 


w tt (x,y, t ) = z tt (-y, x, t) w x (x,y, t ) = z y (-y.x.t ) w xx (x,y, t) = z yy (-y.x.t ) 

w y (x,y,t) = -Zx(-y-x.t ) Wyy(x,y,t) = (-l) 2 z xx (-y.x.O 

SO 0 = C 2 (Zjcx + ZyyX-y, X, 0 = C 2 (w y> , + W**) 

(d) Set vv(x, v, t) = z(kx,ky,kt), then w u (x,y,t) = k 2 ztt(kx,ky,kt), w xx = k 2 z xx , 

Wyy = k 2 Zyy, SO - C 2 (W XX + Wyy ) = ^ (z„ ~ ^ (Z XX + %)) = 0. 

2n 

9.17 sin(1.414c(f + 1000—)) = sin(1.414cf + 1414(2 tt)) = sin(1.414c0, and 
c 

similarly for the 2.236 term. The 1000 insures that we’ve added a whole multiple of 
2n. 
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9.19 

(a) ztt = ~in 2 + m 2 )z, Zxx = - n 2 z , Zyy = -m 2 z, so z tt = z xx + Zyy. Since n and m are 
positive integers, from sin(^7r) = sin(ra7r) = sin(O) = 0 you have the boundary 
value 0. 

(b) There is one such solution for each integer pair (j n,m ) inside the first quadrant 
of the disk of radius 1000. Since each (n,m) sits at the comer of a 1 x 1 square, 
the number of them is roughly equal to the area of the quarter disk. 

9.21 = f'r x sm(kt), and r x - \{x 2 +y 2 ) -1/2 (2x) = x/r. 

z xx =(r^+f(r) r -^^)sin(kt). Similarly - vv = Adding, 

Zxx+Zyy = ( f"ir) + -f(r))sin(kt). 
r 

Since ztt = ~k 2 f(r) sin (kt) we see that /" + + k 2 f = 0 implies ztt = z** + Zyy- 

9.23 

(a) u p = -p~ 2 f(p ± t) +p~ l f'(p + 0 and 

u pp = 2p~ 3 f(p ±t)~ 2 p~ 2 f(p + 0 +p~ l f"(p + 0- So 

Upp + jup = p~ l f'(p + t). 

But Wft = p -1 /", so w is a solution of the wave equation. 

(b) u(p, 0) = p _1 /(p) has maximum of 1 near p= 100. Therefore the maximum of / 
is approximately 100 near p = 100. Then 

u(p,99) = p~ l f(p + 99) 

has a maximum of approximately 100 near p = 1. That is, the maximum of the 
wave was 1 on a spherical shell of radius 100 and becomes after 99 seconds a 
maximum of 100 on a shell of radius 1. 

Section 9.3 
9.25 

(a) (e~ n2t sin (nx)) t - (e - ^ sin (nx)) xx - (-n 2 + n 2 )eT n2t sin(nx) = 0 

(b) (tPe~ x2/(4t) ) t = + and (t p e~ x2/(At) ) x = -f t t p 

{t p e~^/ (Ai) ) xx = ~Y t t v Q- xl ^ At) +(f t ) 2 t p s~ xl ^ 4t \ Therefore p = 

(c) (e _ax cos (ax - bt)) t = boT ax sin (ax - bt ), 

( Q~ ax cos (ax - bt)) x = -ae~ ax cos (ax - bt) - at~ ax sin (ax - bt), 

ioT ax cos(ax - bt)) xx = a 2 e~ ax cos (ax - bt) + 2 a 2 Q~ ax sin(«x - bt) - a 2 e~ ax cos (ax - bt). 
Therefore b = 2 a 2 

(d) With T(x,t) = u(kx,k 2 t) you get T t = k 2 u t (kx,k 2 t), T xx = k 2 u xx (kx,k 2 t), 
so T t - T xx = k 2 {u, - u xx ) = 0. 
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9.27 

(a) 1, -1, -4: T(x,t) = e _? sinv + ^e _zk sin(2x) 

(b) Your sketch ought to indicate the progression described in part (c). 

(c) The second term decreases much faster because of the e -4r factor. So the first 
term is more important as t increases, and the hot spot moves toward the center, 
to the right. 

9.29 y\(t) = ^ Jq e - *sin(x)dv = |e _/ satisfies y' = ~\c~ l - -(yi -0), so Newton’s 
Law of Cooling holds for y \. 

yi(t) = ^ f 0 (e _r sin(x) + c~ 9t sin(3v)) dx = |e _/ + ^e _9r . We observe that Newton’s 
Law of Cooling 




does not hold for y 2 for any choice of k. 


9.31 

(a) Take the t derivative under the integral sign. 

(b) Use the Divergence Theorem in R 2 . 

(c) Properties and definition of Laplacian 

(d) Since D is arbitrary, the integrand must be zero. 

9.33 (x 2 + y 2 + 4 ht) xx = 2 = (v 2 + y 2 + 4 ht) yy , and (v 2 + y 2 + 4 ht) t = Ah. So T is a 
solution. The heat flux is -rWT = -r(2x,2y) so is toward the origin, that happens to 
be the coldest place. 

Section 9.4 


9.35 

(a) A(u\ + U 2 ) — u\ xx + U 2 XX + u\yy + U 2 yy — Au\ + Au 2 — 0 + 0 = 0. 

(b) Write v(v,y) = u(xcos6-ysin6,xsin6 + ycos6). Then 

v x = u x cos 6 + Uy sin 6 , = -u x sin 6+ u y cos 6 , 

Vxx = ( u xx cos 6 + u xy sin 6) cos 6 + ( u yx cos 6 + u yy sin 6) sin 6 , 

Vyy = -(-u xx sin6 + cos 6) sin6 + (~u yx sin6 +Uyy cos 6) cos 6 

So v xx + v yy = u xx (cos 2 6 + sin 2 6) + Wxy(0) + u yy ( cos 2 6 + sin 2 6) = u xx + u yy = 0. 

(c) A(uv) = (Au)v + 2Vu • Vv + u(Av) = 0 + 0 + 0 = 0. 

9.37 z(x,y) = logO 2 +y 2 ), so 

2x 2 y 2 2x(2x) 2 2y(2y) 

Zx = ^7f’ Zy = ^7f’ Zxx = ^7f~(^+y¥’ % = ^77 _ (* 2 +y 2 ) 2 ’ 

4 2(2x 2 + 2y 2 ) _ 

v 2 +y 2 (v 2 +y 2 ) 2 


so 
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9.39 

(a) div (wVw) = wAw + Vw • Vw, so by the Divergence Theorem 

0= I wVw-Nd<x = I (wAw + Vw-Vw)dV 
JdD Jd 

(b) Use part (a) with w-u. That gives 0 = - ( ||Vw|| 2 dV. So \\Vu\\ = 0. That makes 

Jd 

u constant in D , but since u is zero on the boundary it must be zero everywhere. 

(c) u-v solves the Laplace equation and since it is zero on the boundary it is zero 
everywhere by part (b). 

9.41 (a) 1(2 + 2) = 1 (b) i(2) + ±(2) = 1 (c) k(z xx + z yy ) + ( 1 - k)(w xx + w yy ) = 

k + (1 — k) = 1 (d) Zxx 4" W xx + Zyy + Wyy = Zxx + Zyy + W xx + Wyy = 1+0=1 


9.43 

(a) Zxx = - n 2 z , Zy_y = n 2 z , so Az = 0. 

(b) z(0,y) = sin(0) sinh(^y) = 0, z(jt,y) = sin(7r) sinh(^y) = 0, z(x, 0) = sin(^v) sinh(0) = 
0. 

(c) z(^,y) = sinh(Ay) = \{t ny -e~ ny ) is nearly \t ny that is very large when y is large 
positive. 


9.45 


(a) F = Vm = v(. 


X + 


where r 2 = x 2 +y 2 . So 


x 2 +y 2 


) = L 


j 2 -x 2 


-2 xy 

(x 2 + y 2 ) 2 ’ (x 2 + y 2 ) 2 


) 


(1,0) + r~ 4 (y 2 - x 2 , -2xy) 


z)i< = divF = 0 + r 4 div(>' 2 -x 2 ,-2xy)-4r 5 V(r)-(>' 2 -x 2 ,-2xy) 


= r~ 4 (-2x - 2x) -4r~ 5 -(y 2 -x 2 , -2 xy) = r _4 (-4x) - 4r _6 (-xy 2 - x 3 ) 

r 

= 4xr~ 4 ( - 1 + r~ 2 (y 2 + x 2 )) = 0 

(b) (*,y) • F (x,y) = (x,y) ■ ((1,0) + r~ 4 (y 2 - * 2 , -2xy)) 

= x + r 4 (-xy 2 - x 3 ) = v + r 4 (-vr 2 ). This is zero when r = 1. 

(c) Since \x\ < r and |y| < r, ||(y 2 - x 2 , -2xy)\\ is less than a multiple of r 2 . 
Therefore ||F - (1,0)|| = || r~ 4 (y 2 -x 2 , -2xy)\\ tends to zero as r tends to infinity. 

Section 9.5 


9.47 

(a) if/ t = -i Ee~ lEt (f) andzl^ = e~ lEt A(p. Therefore 

iif/t = Ee~ iEt (f) = Q~ iEt (-A(f) + V(p) = -A\\f + Ve~ iEt (p = -A\\f + Vi//. 

(b) |^(X,0l 2 = if/ifs = e lEt (p(X)e~ lEt (p(X ) = |0(X)| 2 does not depend on t. Therefore 
the probability P(S, t) that is an integral of that, does not depend on t. 
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9.49 

(a) <t) x = X' YZ , -c/> xx = -X"YZ = (E x -x 2 )XYZ etc., so 

-A</> + (x 2 +y 2 +z 2 )<l> = ~(X" YZ + XY"Z + XYZ") + (x 2 +y 2 + z 2 )XYZ = (E x +E y + E z )<f>. 

Therefore E = E x + E y + E z . 

(b) W(w) = (a + Z?w + cw 2 )e _w2/2 gives 

W' = (b + 2cw - w(a + bw + cw 2 ))e~ wl/2 = (b + (2c - a)w - bw 2 - cw 3 )e -w2/2 

SO 

-W"+w 2 W 

= | - ((2c - a) - 2 bw - 3 cw 2 - w(b + (2c - a)w - bw 2 - cw 3 )) + w 2 (a + bw + cw 2 ) je _w2/2 

= (- 2c + a + 3bw + 5cw 2 )e _w ^ 2 
For this to be E w W requires 

-2c + a + 3 bw + 5cw 2 = E w (a + bw + cw 1 ). 

Looking at the w 2 terms we find that E w = 5 or c = 0. In case E w = 5 we get 
b = 0, a arbitrary, and c = -2a. 

In case c = 0 we need 

a - E w a , 3b = E w b. 

So E w = 3 or b = 0. In case E w = 3 we find a = 0 and b arbitrary. 

In case b- 0 we find E w = 1 and a arbitrary. 

We have found solutions 

W = ae~ w2/1 , E w = 1; W = bwQ~ w2,1 y E w = 3; W = a(l - 2w 2 )e“ w2/2 , E w = 5. 

(c) Taking for X(v), T(y), Z(z) the various functions found in (b) as listed in the 
table by their E values, we get solutions 0 with E = E x + E y + E z . 


X 

1 

1 

1 

1 

1 

1 

3 

5 

3 

3 

Y 

1 

3 

3 

5 

5 

5 

5 

5 

3 

5 

Z 

1 

1 

3 

1 

3 

5 

5 

5 

3 

3 

E 

3 

5 

7 

7 

9 

11 

13 

15 

9 

13 


(d) The E - 3 case is (some multiple of) 0 (v,y,z) = e * 2/2 e ^ 2/2 e z2/2 so 
| 0| 2 = +;y +z \ We know that to normalize this requires a factor of n~ 3 ^ 2 . 
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Symbols 

-C opposite curve to C, 285 
A - B set difference, 86 
C 1 continuously differentiable, 112 
C 2 , 127 
C n , 161 

DF matrix derivative, 109 
0, 18 
A B, 10 

F VG,U VU, 156 
Af Laplacian, 156 

R n , 18 
R,5 
R 2 , 1 

T unit tangent to curve, 285 

dA, 216 

dV, 216 

d”X, 262 

dcr, 313 

In, 52 

V/ gradient, 111 
D, 86 

dD boundary, 86 
E j standard basis vector, 22 

A 

acceleration 
of fluid, 371 
of particle, 192 
angle 

between vectors, 13 
in R n , 27 

antisymmetric, 24 
approximation 
linear, 104 
of integral, 227 
Taylor, 179 


arc length, 279 
parametrization of curve, 281 
area 

in plane, 209 
lower, 208 
of surface, 313 
upper, 209 
average 

of / on a curve, 284 
of / on region, 263 
of / on surface, 320 

B 

ball 

open, 84 
basis 

standard, 28 
bilinear 
function 
R 2 to R, 15 
in R n , 22 
boundary, 86 
bounded 
function, 89 
set, 87 

smoothly bounded set 
in R", 261 
in 3 space, 213 
in plane, 210 

C 

Cauchy-Schwarz inequality, 26, 65 
Chain Rule 
for curves, 119 
general, 125 

second function scalar, 125 
change of variables 
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Index 


in R n , 268 
in R 2 , 245 

in double integral, 246 
in multiple integral, 268 
charge 

conservation, 367 
density, 364 
circulation 

and curl as local rotation, 153 
in plane fluid flow, 290 
closed 

curve, 83, 300 
set, 86 
closure, 86 

complete the square, 173 
complex valued function 

and Schrodinger equation, 416 
Fundamental Theorem of Algebra, 100 
component 
function, 63 
of vector, 1,18 
composite function, 73 
conjugate, 416 
connected, 83, 91, 264 
simply, 361 
conservation 
energy 
of fluid, 371 
of particle, 197 
law, 368 

mass of fluid, 369 
momentum of fluid, 369 
conservative vector field, 300 
and curl, 304, 361 
constraint, 186 
continuity equation, 369 
continuous 
at a point, 79 
compositions, 82 
on a set, 79 

sums, products and quotients, 81 
uniformly, 90 
contours, 67 
convergence 

and continuous function, 79 
of Riemann sums, 227 
of subsequence, 89 
sequence in W 1 , 79 
convex, 396 
coordinates 
cylindrical, 95 
in triple integral, 268 
polar, 94 

in double integral, 249 


spherical, 96 
in triple integral, 272 
cross product, 58 
curl, 149 

conservative vector field, 304, 361 
curve 

helix in R 3 , 69 
in R n , 83 

parametrization, 83 
smooth, 279 
cylinder, 88 

cylindrical coordinates, 268 

D 

deformation, 40 
density 

charge and current, 364 
divergence as flux, 152 
electric charge, 367 
fluid, 369 
linear mass, 279 
mass of material, 205 
population, 206 
probability 
in R 3 , 275 
in R 2 , 257 
dependence 
linear 
in R", 19 
derivative 

directional, 122 
Jacobian, 110 
matrix, 109 

of integral with parameter, 273 
partial, 105 
second, 127, 168 
rules 

Chain, 119, 125 
for div, V, curl, 156 
sums and products, 156 
derivative test 
first, 166, 184 
second, 171, 181 
determinant 

key properties, 33 
of list of vectors, 37 
of matrix, 39 
differentiable, 103, 104 
continuously, 112 
n times, 161 
function R n to R m , 109 
locally linear, 103 
dimension 

^-dimensional space, 18 
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n-dimensional vector, 18 
two-dimensional vector, 1 
directional derivative, 122 
divergence, 148 
Divergence Theorem, 335 

analogue of Fundamental Theorem, 375 
inR 3 , 348 
dot product 
in R n , 25 
in R 2 , 10 

double integral, 226 

E 

Earth 

Moon orbit, 308 
temperature, 407 
eigenvalue, eigenvector, 186 
endpoints, 83 
energy 

internal of fluid, 371 
kinetic 
fluid, 371 
of particle, 197 
of particle in gradient, 194 
potential 
of particle, 197 
enthalpy, 371 

equilibrium and Laplace Equation, 410 
Extreme Value Theorem, 89 
extremum 

constrained, 184 
local, 166 

F 

fluid dynamics, 369 
flux 

across curve in R 2 , 292 
across surface, 321 
through parallelogram, 59 
force 

gravity, 196 

on membrane under pressure, 413 
on string, 388 

on vibrating membrane, 398 
pressure, 155 
frequency, 395 
function 

bilinear, 15, 22 
bounded, 89 
component, 63 
composite, 73 
constant, 64 
continuous 
at point, 79 


curve, 83 

sequence of points converges, 79 
uniformly, 90 
differentiable, 104 
continuously, 112 
domain, 63 
from R n to R m , 63 
from R n to R n vector field, 70 
from R to R n , 69 
harmonic, 411 
implicitly defined, 140 
integrable, 226 
inverse, 135, 140 
level set, 67 
linear, 5, 21, 49 
represented by matrix, 50 
locally linear, 103 
multilinear, 33 
one to one, 63 
onto, 63 
range, 63 

Fundamental Theorem 
of Algebra, 100 
of Calculus, 375 
of Line Integrals, 303 

G 

gradient, 111 
graph of function 
R 3 to R, 68 
R 2 to R, 66 
gravity, 196 
is conservative, 300 
linear approximations to, 77 
Greatest Lower Bound Theorem, v 
Green’s Theorem, 340 

H 

heat equation, 404 

higher dimensions, 406 
helix, 69 

Hessian matrix, 168 
hyperplane, 19 

I 

image, 63 

Implicit Function Theorem, 140 
multivariable, 144 
indefinite matrix, 169, 173 
independence 
linear 
in R”, 19 
in R 2 , 4 
of path, 303 
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of surface, 364 
inequality 

Cauchy-Schwarz, 26 
triangle, 27 
initial data, 391 
integrable 

on smoothly bounded set, 226 
over unbounded set, 254, 258 
integral 

additivity, 219 
approximate, 227 

continuous nonnegative function, 225 
definition, 226 

determined by properties, 234 
double, 226 
iterated, 239, 240 
line 

arc length, 279 
Fundamental Theorem, 303 
of scalar, 283 
of scalar in R n , 293 
of vector, 285 
of vector in R n , 294 
lower and upper bound, 219 
Mean Value Theorem, 264 
of bounded function, 226 
of continuous function 
as difference, 230 
properties, 232 
of scalar over surface, 319 
of vector over surface, 321 
over unbounded set, 254 
Riemann sum, 227 
triple, 264 

variation with parameter, 273 
integrand, 226 
interior, 85 
inverse 
matrix, 53 

Inverse Function Theorem, 135 
in R n , 140 

inverse square vector field, 72 

J 

Jacobian, 110 

in change of variables, 246 
jump, 381 

K 

Kepler, 201 
kinetic energy 
fluid, 371 


L 

Lagrange multiplier, 184 
Laplace 

equation, 411 
operator, 156 
Law of Cosines 
in R n , 28 
in R 2 , 14 

Least Upper Bound Theorem, v 
length 

of vector, 9 
in R n , 25 
level set, 67, 140 
line, 56 
linear 

approximation, 104 
combination 
trivial, 18 

combination of vectors, 18 
in R 2 , 3 
dependence 
in R", 19 
function 
R n to R m , 64 
R n to R, 21 
R 2 to R, 5 

represented by matrix, 50, 65 
function R n to R*, 49 
independence 
in R", 19 
in R 2 , 4 

linearity of integral, 218 
locally, 103 

lower and upper volume in R n , 261 
lower integral, 223 

M 

mass 

conservation law for fluid, 369 
density of fluid, 369 
line integral, 283 
of object, 205 
of particle, 192 
matrix 
n x k, 50 
derivative, 109 
eigenvalue, 186 
Hessian, 168 
identity, 52 
inverse, 53 
norm, 65 

positive definite, 169 
rectangular array, 38 
represents linear function, 50 
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symmetric, 168 
maximum, 89 
local, 166 
Maxwell 

equations, 165, 364 
Mean Value Theorem, 107 
for integrals, 264 
multivariable, 127 
membrane, 398 
minimum 
local, 166 
momentum 

conservation law for fluid, 369 
Monotone Convergence Theorem, v 
Moon 

elliptical orbit, 201 
orbit, 308 

multilinear function, 33 
N 

negative definite matrix, 169, 173 
negatively oriented list of vectors, 40 
neighborhood, 86 
Newton 

Law of Cooling, 409 
Law of Motion, 192 
norm 

matrix, 65 
of vector 
in R n , 25 
in R 2 , 9 
normal 

component of vector field, 321 
outward, 335 
to hyperplane, 57 
to plane, 57 
to tangent plane, 115 
vector to plane curve, 292 

O 

one to one, 52, 63 
onto, 63 
open 
ball, 84 
set, 86 
orbit 

Moon, 201, 308 
Saturn, 201 
orbital, 275 
orientable surface, 323 
orientation 

preserved by mapping, 246 
reversal of curve, 285 
oriented 


list of vectors, 40 
orthogonal, 13 

orthonormal set of vectors, 28 

P 

pairwise orthogonal, 28 
parametrization 
of curve, 83 

of curve by arc length, 281 
of surface, 310 

partial differential equation, 162 
pde, 162 

fluid energy, 372 
fluid mass, 369 
fluid momentum, 370 
heat equation 
ID, 403 
2D, 406 
Laplace, 411 
Maxwell, 165, 364 
membrane 
under pressure, 413 
vibration, 398 
Schrodinger, 416 
wave equation 
ID, 389 
2D, 399 
period, 395 
permutation 
signature, 36 
piecewise smooth 
curve, 280 
plane, 56 
point 

interior, 85 
polar coordinates, 94 
positive definite matrix, 169, 173 
determinant test, 182 
positively oriented list of vectors, 40 
potential, 300 

energy of particle, 197 
vector, 158, 366 
pressure 
force, 155 
probability 

and Schrodinger’s equation, 417 
density for electron, 275 
density in R 3 , 275 
density in R 2 , 257 
normal distribution, 256 
product 

cross in R 3 , 58 
dot 

in R n , 25 
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in R 2 , 10 
of matrices, 51 

product rule for div, V, curl, 156 
Pythagorean Theorem 
in W 1 , 31 
in R 2 , 9 

R 

regular 

set in R 3 , 348 
Riemann sum 

two variables, 227 

S 

saddle point, 172 
scalar, 2 

Schrodinger equation, 416 
second derivative test, 181 
sequence converges, 79 
set 

boundary, 86 
bounded, 87 
closed, 86 
closure, 86 

connected, 83, 91, 264 
open, 86 
regular, 338 
regular in R 3 , 348 
simply connected, 361 
smoothly bounded 
in R n , 261 

smoothly bounded in R 2 , 210 
shock wave, 382 
signature of permutation, 36 
simple 

x simple set, 238 
y simple set, 239 
simplex, 44 
simply connected, 361 
smooth 

change of variables, 268 
curve, 279 
in R n , 293 

parametrization, 279 
surface, 310 
smoothly bounded set 
in R n , 261 
in R 2 , 210 
sound 
wave, 374 
space 

n-dimensional, 18 
two-dimensional, 1 
span of vectors, 18 


spherical coordinates, 272 
standard basis, 28 
stereographic projection, 76 
Stokes’ Theorem, 356 
string 

tied at ends, 394 
vibration, 387 
surface, 310 
area, 313 
flux across, 321 
integral of scalar, 319 
integral of vector, 321 
orientable, 323 
oriented, 323 
piecewise smooth, 320 
smooth, 310 
tangent plane, 115 
symmetric, 24 
symmetric matrix, 168 
determinant test, 182 
system 

and Implicit Function Theorem, 144 
of linear equations, 8 


tangent 

plane to graph, 115 
plane to surface, 321 
vector to curve, 285 
Taylor approximation, 179 
first order, 177 
second order, 178 
Theorem 

Chain Rule, 119 
in R", 120 

matrix derivatives, 125 
Change of Variables in integral, 246 
continuity of 

differentiable scalar function, 104 
differentiable vector function, 109 
Divergence 
in R 3 , 348 
in R 2 , 338 
Extreme Value, 89 
Fundamental 
of Algebra, 100 
of Calculus, 333 
of Line Integrals, 303 
Greatest Lower Bound, v 
Green’s, 340 
Implicit Function, 140 
multivariable, 144 
Intermediate Value, 91 
Inverse Function 
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R 2 , 135 
in R", 140 

Least Upper Bound, v 
Mean Value, 107 
for integrals, 264 
multivariable, 127 
Monotone Convergence, v 
Stokes’, 356 
Taylor, 177, 179 
triangle inequality, 27 
triple integral, 264 

U 

unit 

tangent to curve, 285 
unit vector, 9 
upper integral, 223 


vector 

n-dimensional, 18 
component, 1 

deformation of ordered list, 40 
linear combination, 18 
norm 
in R n , 25 
in R 2 , 9 

oriented list, 40 
potential, 158, 366 
span, 18 
sum, 1 

sum in R n , 18 
two dimensional, 1 


unit, 9 

i,j\k, 72 

% 28 

tangent to curve, 285 
zero, 2 

vector field, 70 
conservative, 300 
inverse square, 72 
velocity 
fluid, 369 
of particle, 111 
vibration 

of membrane, 398 
of string, 395 
volume, 261 
in W 1 , 261 
signed, 45 
in R 3 , 213 

volumetric flow rate, 59 

W 

wave 

electromagnetic, 165 
equation 
ID, 389 
2D, 399 
shock, 382 
sound, 374 
traveling, 401 
work 

and conservative vector field, 300 
fluid pressure, 371 
integral on curve, 285 


